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TADG-15: AN EXTRACELLULAR SERINE PROTEASE 
5 OVEREXPRESSED IN BREAST AND OVARIAN CARCINOiMAS 

10 

BACKGROUND OF THE INVENTION 

15 Field of the Invention 

The present invention relates generally to the fields of 
cellular biology and the diagnosis of neoplastic disease. More 
specifically, the present invention relates to an extracellular serine 
protease termed Tumor Antigen Derived Gene- 15 (TADG-15), which is 

20 overexpressed in breast and ovarian carcinomas. 

Description of the Related Art 

Extracellular proteases have been directly associated with 
tumor growth, shedding of tunior cells and invasion of target organs, 
25 . Individual classes of proteases are involved in, but not limited to (1) 
the digestion of stroma surrounding the initial tumor area, (2) the 
digestion of the cellular adhesion molecules to allow dissociation of 
tumor cells; and (3) the invasion of the basement membrane for 

1 . 
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metastatic growth and the activation of both tumor growth factors 
and angiogenic factors, 

Tfiie prior art is deficient in the lack of effective means of 
screening to identify proteases overexpressed in carcinoma. The 
5 present invention fulfills this longstanding need and desire in the art, 

SUMMARY OF THE INVENTION 

10 The present invention discloses a screening program to 

identify proteases overexpressed in carcinoma by examining PGR 
products amplified using differential display in early stage tumors, 
metastatic tumors compared to that of normal tissues. ; 

In one embodiment of the present invention, there is 

15 provided a DNA encoding a TADG-15 protein selected from the group 
consisting of: (a) isolated DNA which encodes a TADG-15 protein; (b) 
isolated DNA which hybridizes to isolated DNA of (a) above and which 
encodes a TADG-15 protein; and (c) isolated DNA differing from the 
isolated DNAs of (a) and (b) above in codon sequence due to the 

20 degeneracy of the genetic code, and which encodes a TADG-15 
protein. 

In another embodiment of the present invention, there is 
provided a vector capable of expressing the DNA of the present 
invention adapted for expression in a recombinant cell and regulatory 
25 elements necessary for expression of the DNA in the cell. 

In yet another embodiment of the present invention, 
there is provided a host ceil transfected with the vector of the present 
invention, the vector expressing a TADG-15 protein. 

■ 2 
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In still yet another embodiment of the present invention, 
there is provided a method of detecting expression of a TADG-IS 
mRNA, comprising the steps of: (a) contacting mRNA obtained from 
the cell with the labeled hybridization probe; and (b) detecting 
5 hybridization of the probe with the mRNA. 

Other and further aspects, features, and advantages of the 
present invention will be apparent from the following description of 
the presently preferred embodiments of the invention given for the 
purpose of disclosure. 

10 

BRIEF DESCRIPTION OF THE DRAWINGS 

So that the matter in which the above-recited features, 
advantages and objects of the invention, as well as others which will 

15 become clear, are attained and can be understood in detail, more 
particular descriptions of the invention briefly summarized above 
may be had by reference to certain embodiments thereof which are 
illustrated in the appended drawings. These drawings form a part of 
the specification. It is to be noted, however, that the appended 

20 drawings illustrate preferred embodiments of the invention and 

, 4 

therefore are not to be considered limiting in their scope. 

Figure 1 shows a comparison of PGR products derived 
from normal and breast carcinoma cDNA as shown by staining in an 
agarose gel, 

25 Figure 2 shows a comparison of the serine protease 

catalytic domain of TADG-15 (SEQ ID No: 14) with hepsin (Heps, SEQ 
ID No: 3), (Scce, SEQ ID No: 4), trypsin (Try, SEQ ID No: 5), 
chymotrypsin (Chymb, SEQ ID No: 6), factor 7 (Fac7, SEQ ID No: 
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7) and tissue plasminogen activator (Tpa, SEQ ID No: 8). The asterisks 
indicate conserved amino acids of catalytic triad. 

Figure 3 shows quantitative PGR analysis of TADG-15 

expression* 

5 Figure 4. shows the ratio of TADG-15 expression to 

expression of P-tubulin in normal tissues, • low malignant potential 
tumors fLMP) and carcinomas. 

Figure 5 shows the TADG-15 expression in tumor cell 
lines derived from both ovarian and breast carcinoma tissues. 
10 Figure 6 shows the overexpression of TADG-15 in other 

tumor tissues. 

Figure 7 shows the Northern blots of TADG-15 expression 
in ovarian carcinomas, fetal and normal adult tissues. : 

Figure 8 shows a diagram of the TADG-15 transcript and 
15 the clones with the origin of their derivation. 

Figure 9 shows nucleotide sequence of the TADG-15 
cDNA (SEQ ID No: 1). and amino acid sequence of the TADG-15 protein 
(SEQ ID No: 2) 

Figure 1 0 shows the amino acid sequence of the TADG- 
20 15 protease including functional sites and domains. 

Figure 11 shows, a structure diagram of the TADG-15 
protein including functional domains. 

Figure 12 shows a nucleotide sequence comparison 
between TADG-15 and human SNC-19 (GeneBank accession #1120428). 

25. 

DETAILED DESCRIPTION OF THE INVENTION 



4 
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As used herein, the term "cDNA" shall refer to the DNA 

copy of the mRNA transcript of a gene. 

• As used herein, the term "derived amino acid sequence" 
shall mean the amino acid sequence determined by reading the triplet 
5 sequence of nucleotide bases in the cDNA. 

As used herein the term "screening a library" shall refer 
to the orocess of using a labeled probe to check whether, under the 
appropriate conditions, there is a sequence, complementary to the 
probe present in a particular DNA library. In addition, "screening a 
10 library" could be performed by PGR. 

As used herein, the term "PGR" refers to the polymerase 
chain reaction that is the subject of U.S. Patent Nos. 4,683,195 and 
4,683,202 to Mullis, as well as other improvements now known in the 
art. 

15 The TADG-15 cDNA is 3147 base pairs long (SEQID No:l)' 

and encoding for a 855 amino acid protein (SEQ ED No:2). The 
availability of the TADG-15 gene opens the way for a number studies 
that can lead to various applications. For example, the TADG-15 gene 
can be used as a diagnostic or therapeutic target in ovarian carcinoma 

20 and other carcinomas including breast, prostate, lung, and colon. 

In accordance with the present invention there may be 
employed conventional molecular biology, microbiology, and 
recombinant DNA techniques within the skill of the art. Such 
techniques are explained fully in the literature. See, e.g., Maniatis, 

25 Fritsch & Sambrook, "Molecular Cloning: A Laboratory Manual (1982); 
"DNA Cloning: A Practical Approach," Volumes I and II (D.N. Glover ed. 
1985); "Oligonucleotide Synthesis" (M.J. Gait ed. 1984); "Nucleic Acid 
Hybridization" [BX>. Hames & S J. Higgins eds. (1985)]; 



5. 
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'Transcription and Translation" [B,D. Hames & SJ. Higgins eds. (1984)]; 
"Animal Cell Culture" [Rl Freshney, ed. (1986)]; "Immobilized Cells 
And Enzymes" [IRL Press, (1986)]; B. Perbal, "A Practical" Guide To 
Molecular Cloning" (1984). 
5 Therefore, if appearing herein, the following terms shall 

have the definitions set out below. 

The amino acid described herein are preferred to be in the 
"L" isomeric form. However, residues in the "D" isomeric form can b e 
substituted for any L-amino acid residue, as long as the desired 
10 functional property of immunoglobulin-binding is retained by the 
polypeptide. NH2 refers to the free amino group present at the amino 

terminus of a polypeptide. COOH refers to the free carboxy group 
present at the carboxy terminus of a polypeptide. In keeping with 
standard polypeptide nomenclature, / BioL Chem., 243:3552-59 
15 (1969), abbreviations for amino acid residues are shown in the 
following Table of Correspondence: 



6 
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TABLE OF CORRESPONDENCE 



SYMBOL 




. AMINO ACID 


1-Letter 


3-Letter 




Y 


Tyr 


tyrosine 


G 


Gly 


glycine . 


F 


Phe 


Phenylalanine 


M 


Met 


methionine 


A 


Ala 


alanine 


S 


Ser 


serine 


I 


He 


isoleucine 


L 


■ Leu 


leucine 


T 


Thr 


threonine 


V 


, Val 


valine 


P 


Pro 


proline 


K 


Lys 


lysine 


H 


His 


histidine 


Q 


Gin 


glutamine 


E 


Glu 


glutamic acid 


W 


Trp . 


tryptophan 


R 


Arg 


arginine 


D 


Asp 


aspartic acid 


N 


Asn 


asparagine 


C 


Gys 


cysteine 



I 

It should be noted that ail amino-acid residue sequences 
are represented herein by formulae whose left and right orientation 
is in the conventional direction of amino-terminus to carboxy- 
terminus. Furthermore, it should be noted that a dash at the 
beginning or end of an amino acid residue sequence indicates a 
peptide bond to a further sequence of one or more amino-acid 
residues. The above Table is presented to correlate the three-letter 
and one-letter notations which may appear alternately herein* 

A "replicon'V is any genetic element (e.g., plasmid, 
chromosome, virus) that functions as . an autonomous unit of DNA 
replication in vivo\ i.e., capable of replication under its own 
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control. 

A "vector" is a repUcon., such as piasmid, phage or.cosmid, 
to which another DNA segment may be attached so as to bring about 
the replication of the attached segment, 
5 A "DNA molecule'* refers to the polymeric form^ of 

deoxyribonucleo tides (adenine, guanine, thymine, or cytosine) in its 
either single stranded form, or a double-stranded helix. This term 
refers only to the primary and secondary structure of the molecule, 
and does not limit it to any particular tertiary forms. Thus, this term 

10 includes double-stranded DNA found, inter alia, in linear DNA 
molecules (e.g.,. restriction fragments), viruses, plasmids, and 
chromosomes. In discussing the structure herein accordiiig to the 
normal convention of giving only the sequence in the 5' to 3' direction 
along the nontranscribed strand of DNA (i.e., the strand having a 

15 sequence homologous to the mRNA). 

An "origin of replication" refers to those DNA sequences 

s 

that participate in DNA synthesis. 

A DNA "coding sequence" is a double-stranded DNA 
sequence which is transcribed and translated into a polypeptide in 

20 vivo when placed under the control of appropriate regulatory 
sequences. The boundaries of the coding sequence are determined by 
a start codon at the 5' (amino) terminus and a translation stop codon 
at the 3' (carboxyl) terminus. A coding sequence can include, but is 
not limited to, prokaryotic sequences, cDNA from eukaryotic mRNA, 

25 genomic DNA sequences from eukaryotic (e.g., mammalian) DNA, and 
even .* synthetic DNA sequences, A polyadenylation signal and 
transcription termination sequence will usually be located 3* to the 
coding sequence, 

8 
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Transcriptional and transiadonai control sequences are 
DNA regulatory sequences, such as promoters, enhancers, 
polyadenyiation signals, terminators, and the like, that provide for the 
expression of a coding sequence in a host ceil. 
5 A "promoter sequence" is a DNA regulatory region capable 

of binding RNA polymerase in a cell and initiating transcription of a 
downstream (3' direction) coding sequence. For purposes of defining 
the present invention, the promoter sequence is bounded at its 3' 
terminus by the transcription initiation site and extends upstream (5' 
10 direction) to include the minimum number of bases or elements 
necessary to initiate transcription at levels detectable above 
background. Within the promoter sequence will be found a 
transcription inidation site, as well as protein binding domains 
(consensus sequences) responsible for the binding of RNA polymerase. 
15 Eukaxyotic promoters often, but not always, contain "TATA" boxes and 
"CAT" boxes. Prokaryotic prohioters contain Shine-Dalgarno 
sequences in addition to the -10 and -35 consensus sequences. 

An "expression control sequence" is a DNA sequence that 
controls and regulates the transcription and translation of another . 
20 DNA sequence. A coding . sequence is "under the control" of 
transcriptional and translational control sequences in a cell when RNA 
polymerase transcribes the coding, sequence into mRNA, which is then 
translated into the protein encoded by the coding sequence. 

A "signal sequence" can be included near the coding 
25 sequence. This sequence encodes a signal peptide, N-terminal to the 
polypeptide,, that communicates to the host cell to direct the 
polypeptide to the cell surface or secrete the polypeptide into the 
media, and this signal peptide is clipped off by the host cell 

9 
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before the protein leaves the cell. Signal sequences can be found 
associated with a variety of proteins native to prokaryotes and 
eukaryotes. 

The term "oligonucleotide", as used herein in referring to 
5 the probe of the present invention, is defined as a molecule comprised 
of two or more ribonucleotides, preferably more than three. Its exact 
size will depend upon many factors which, in turn, depend upon the 
ultimate function and use of the oligonucleotide. 

The term "primer" as used herein refers to an 

J 

10 oligonucleotide, whether occurring naturally as in a purified 
restriction digest or produced synthetically, which is capable of acting 
as a point of initiation of synthesis when placed under conditions in 
which synthesis of a primer extension product, which is 
complementary to a nucleic acid strand, is induced, i.e., in the 

15 presence of nucleotides and an inducing agent such as a DNA 
polymerase and at a suitable temperature and pH. The primer may 
be either single-stranded or double-stranded and must be sufficiently 
long to prime the synthesis of the desired extension product in the 
presence of the inducing agent- The exact length of the primer will 

20 depend upon many factors, including temperature, source of primer 
and use the method. For example, for diagnostic applications, 
depending on the complexity of the target sequence, the 
oligonucleotide primer typically contains 15-25 or more nucleotides, 
although it may contain fewer nucleotides. 

25 The primers herein are selected to be "substantially" 

complementary to different strands of a particular target DNA 
sequence. This means that the primers must be sufficiently 
complementary to hybridize with their respective strands. 

10 
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Therefore, the primer sequence need not reflect the exact sequence of 
the template. For example, a non-complementary nucleotide 
fragment may be attached to the 5' end of the primer, with the 
.remainder of the primer sequence . being complementary to the 
5 strand. Alternatively, non-complementary bases or longer sequences 
can be interspersed into the primer, provided that the primer 
sequence has sufficient complementary with the sequence or 
hybridize therewith and thereby form the template for the synthesis 
of the extension product. 

10 As used herein, the terms "restriction endonucleases" and 

"restriction enzymes" refer - to enzymes,, each of which cut double- 
stranded DNA at or near a specific nucleotide sequence. 

A ceil has been "transformed" by exogenous or 
heterologous DNA when such DNA has been introduced inside the cell. 

15 . The transforming DNA may or may not be integrated (cdvalently 
linked) into the genome of the cell. In prokaryotes, yeast, and 
mammalian cells for example, the transforming DNA may be 
maintained on an episomal element such as a piasmid. With respect 
to eukaryotic cells, a stably transformed cell is one in which the 

20 transforming DNA has become integrated into a chromosome so that it 
is inherited by daughter cells through chromosome replication. This 
stability is demonstrated by the ability of the eukaryotic cell to 
establish ceil lines or clones comprised of a population of daughter 
cells containing the transforming DNA. A "clone" is a population of 

25 ceils derived from a single cell or ancestor by mitosis, A "cell line" is 
a clone of a primary cell that is capable of stable growth in vitro for 
many generations. 

Two DNA sequences are "substantially homologous" 

1 1 
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when at least about 75% (preferably at least about 80%, and most 
preferably at least about 90% or 95%) of the nucleotides match over 
the defined length of the DNA sequences. Sequences that are 
substantially homologous can be identified by comparing the 
sequences using standard software available in sequence data banks, 
or in a Southern hybridization experiment under, for example, 
stringent conditions as defined for that particular system. Defining 
appropriate hybridization conditions is within the skill of the art. See, 
e.g., Maniatis et all, supra; DNA Cloning, Vols. I & II, supra: Nucleic 
Acid Hybridization, supra, . 

A '^heterologous* region of the DNA construct is an 
identifiable segment of DNA within a larger DNA molecule that is not 
found in association with the. larger molecule in nature. Thus, when 
the heterologous region encodes a mammalian gene, the gene will 
usually be flanked by DNA that does not flank the mammalian 
genomic DNA in the genome of the source organism.. In another 
example, coding sequence is a construct where the coding sequence 
itself is not found in nature (e.g., a cDNfA where the genomic coding 
sequence contains introns, or synthetic sequences having codons 
different than the native gene). Allelic variations or naturally- 
occurring mutational events do not give rise to a heterologous region 
of DNA as defined herein. 

The labels most commonly employed for these studies are 
radioactive elements, enzymes, chemicals which fluoresce when 
exposed to ultraviolet light, and others. A number of fluorescent 
materials are known and can be utilized as labels. These include, for 
example, fluorescein, rhodamine, auramine, Texas Red, AMCA blue 
and Lucifer Yellow. A particular detecting material is anti-rabbit 

12 
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antibody prepared in goats and conjugated with fluorescein through 
an isothiocyanate. 

Proteins can also be labeled with a radioactive element or 
with an enzyme- The radioactive label can be detected by any of the 
5 currently available counting procedures. The preferred isotope may 
be selected from ^H, i^C, 32p, 353, 36ci, 5iCr, 57Go, 58Co, 59Fe, ^oy, i25i, 
1311^ and i86Re, 

Enzyme labels are likewise useful, and can be detected b y 

r 

any of the presently utilized colorimetric, spectrophotometric, 
10 fluorospectrophotometric, amperoraetric or gasometric techniques. 
The enzyme is conjugated to the selected particle by reaction with 
bridging molecules such as carbodiimides, diisocyanates, 
giutaraidehyde and the like. Many enzymes which can be used in 
these procedures are known and can be utilized. The preferred are 
15 peroxidase, p-giucuronidase, p-D-glucosidase, p-D-galactosidase, 
urease, glucose oxidase plus peroxidase and alkaline phosphatase. 
U.S. Patent Nos. 3,654,090, 3,850,752, and 4,016,043 are referred to 
by way of example for their disclosure of alternate labeling material 
and methods. 

20 A particular assay system developed and utilized in the 

art is known as a receptor assay. In a receptor assay, the material to 
be assayed is appropriately labeled and then certain cellular test 
colonies are inoculated with a quantitiy of both the label after which 
binding studies are conducted to determine the extent to. which the 

25 labeled material binds to the cell receptors. In this way, differences, 
in affinity between materials can be ascertained. 

An assay useful in the art is known as a "cis/trans" assay. 
Briefly, this assay employs two genetic constructs, one of which 

13 
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is typically a plasmid that continually expresses a particular receptor 
of interest when transfected into an appropriate cell line, and the 
second of which is a plasmid that expresses a reporter such as 
lucif erase, under the control of a receptor/Iigand complex. Thus, for 
5 example, if it is desired to evaluate a compound as a ligand for a 
particular receptor, one of the plasmids would be a construct that 
results in expression of the receptor in the chosen cell line, while the 
second plasmid would possess a promoter linked to the luciferase 
gene in which die response element to the particular receptor is 

1.0 inserted. If the compound under test is an agonist for the receptor, 
the ligand will complex with the receptor, and the resulting complex 
will bind the response element and initiate transcription of the 
luciferase gene. The resulting chemiluminescence is then measured 
photometrically, and dose response curves are obtained and 

15 compared to those of known ligands. The foregoing protocol is 
described in detail in U,S, Patent No. 4,981,784. . 

As used herein, the term "host" is meant to include not 
only prokaryotes but also eukaryotes such as yeast, plant and animal 
cells. A recombinant DNA molecule or gene which encodes a human 

20 TADG-15 protein of the present invention can be used to transform .a 
host using any of the techniques commonly known to those of 
ordinary skill in the art. Especially preferred, is the use of a vector 
containing coding sequences for the gene which encodes a human 
TADG-15 protein of the present invention for purposes of prokaryote 

25 transformation. Prokaryotic hosts may include E. coli, S. 
tymphimurium, Serratia marcescens a.nd Bacillus subtilis. Eukaryotic 
hosts include yeasts such as Pichia pastoris^ mammalian cells and 
insect cells. 

1 4 
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In general, expression vectors containing promoter 
sequences which facilitate the efficient transcription of the inserted 
DNA fragment are used in connection with the host. The expression 
vector typically contains an origin of replication, promoter(s), 

. 5 terminator(s), as well as specific genes which are capable of providing 
phenotypic selection in transformed cells. The transformed hosts can 
be fermented and cultured according to means known in the art to 
achieve optimal cell growth. 

The invention includes a substantially pure DNA encoding 

10 a TADG-15 protein, a strand of which DNA will hybridize at high 
stringency to a probe containing a sequence of at least 1 5 
consecutive nucleotides of (SEQ ED N0:1). The protein, encoded by the 
DNA of this invention may share . at least 80% sequence identity 
(preferably 85%, more preferably 90%, and most preferably 95%) 

15 with the amino acids listed in Figure 10 (SEQ ID NO:2). More 

• ■ ' - 

preferably, the DNA includes the coding sequence of the nucleotides 
of Figure 9 (SEQ ID NO:l), or a degenerate variant of such a sequence. 

The probe to which the DNA of the invention hybridizes 
preferably consists of a sequence of at least 20 consecutive 

20 nucleotides, more preferably 40 nucleotides, even more preferably 
50 nucleotides, and most preferably 100 nucleotides or more (up to 
100%) of the coding sequence of the nucleotides listed in Figure 9 
(SEQ ID NO: 1) or the complement thereof. Such a probe is useful for 
detecting expression of TADG-15 in a human ceil by a method 

25 including the steps of (a) contacting mRNA obtained from the cell 
with the labeled hybridization probe; and (b) detecting hybridization 
of the probe with the mRNA. . . 

This invention also includes a substantially pure 
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DNA containing a sequence of at least 15 consecutive nucleotides 
(preferably 20, more preferably 30, even more preferably 50, and 
most preferably ail) of the region from nucleotides 1 to 3147 of the 
nucleotides listed in Figure 9 (SEQ ED NO:l). 
5 By "high stringency" is meant DNA hybridization and 

wash conditions characterized by high temperature and low salt 
concentration, e.g., wash conditions of at a salt concentration of 
approximately 0.1 x SSC, or the functional equivalent thereof. For 
example, high, stringency, conditions may include hybridization at 

10 about 42°C in the presence of about 50% formamide; a first wash at 
about 65''C with about 2 x SSC containing 1% SDS; followed by a 
second wash at about 65°C with about 0.1 x SSC. 

By "substantially pure DNA" is meant DNA that is not part 
of a milieu in which the DNA naturally occurs, by virtue of . separation 

15 (partial or total purification) of some or all of the molecules of that 
milieu, or by virtue of alteration of sequences that flank the claimed 
DNA. The term therefore includes, . for example, a recombinant DNA 
which is incorporated into a vector, into an autonomously replicating 
plasmid or virus, of into the genomic DNA of a prokaryote or 

20 eukaryote; or which exists as a separate molecule (e.g., a cDNA or a 
genomic or cDNA fragment produced by polymerase chain reaction 
(PCR) or restriction endonuclease digestion) independent of other 
sequences. It also includes a recombinant DNA which is part of a 
hybrid gene encoding additional polypeptide sequence, e.g., a fusion 

25 protein. Also included is a recombinant DNA which includes a 
portion of the nucleotides listed in Figure 9 (SEQ ID N0:1) which 
encodes an alternative splice variant of .TA.DG-15. 

The DNA may have at least about 70% sequence 

16 
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identity to the. coding sequence of tiie nucleotides listed in Figure 9 
(SEQID N0:1), preferably at least 75% (e.g. at least 80%); and most 
preferably at least 90%, The identity .between two sequences is a 
direct function of the number of matching or identical positions. 
5 When a subunit position in both of the two sequences is occupied b y 
the same monomeric subunit, e,g., if a given position is occupied by 
an adenine in each of two DNA molecules, then they are identical a t 
that position. For example, if 7 positions in a sequence 
10 nucleotides in length are identical to the corresponding positions 

10 in a second 10- nucleotide sequence, then the two sequences have 
70% sequence identity. The length of comparison sequences will 
generally be at least 50 nucleotides, preferably at least 6 0 
nucleotides, more preferably at least 75 nucleotides, and most 
preferably 100 nucleotides. Sequence identity is typically measured 

15 using sequence analysis software (e.g.. Sequence Analysis Software 
Package of the Genetics Computer Group, University of Wisconsin 
Biotechnology Center, 1710 University Avenue, Madison, WI 53705). , 

The present invention comprises a vector comprising a 
DNA sequence which encodes a human TADG-15 protein and said 

20 vector is capable of replication in a host which comprises, in operable 
linkage: a) an origin of replication; b) a promoter; and c) a DNA 
sequence coding for said protein. Preferably, the vector of the 
present invention contains a portion of the DNA sequence shown in 
SEQID No: 1. A "vector" may be defined as a replicable nucleic acid 

25 construct, e.g., a plasmid or viral nucleic acid. Vectors may be used 
to amplify and/or express nucleic acid encoding TADG-15 protein. 
An expression vector is a replicable construct in which a nucleic acid 
sequence encoding a polypeptide is operably linked to suitable 
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control sequences capable of effecting expression of the polypeptide 
in a cell. The need for such control sequences will vary depending 
upon the cell selected and the transformation method chosen. 
Generally, control sequences include a transcriptional promoter 
5 and/or enhancer, suitable mRNA ribosomal binding sites, and 
sequences which control the termination of transcription and 
translation. Methods which are well known to those skilled in the art 
can be used to construct expression vectors containing appropriate 
transcriptional and translational control signals. See for example, the 
10 techniques described in Sambrook et al., 1989, Molecular Cloning: A 
Laboratory Manual (2nd Ed.), Cold Spring Harbor Press, N.Y. A gene 
and its transcription control sequences, are defined as being 
"operably linked" if the . transcription control sequences effectively 
control the transcription of the gene. Vectors of the invention 
15 include, but are not limited to, plasmid vectors and viral vectors. 
Preferred viral vectors of the invention are those derived from 
retroviruses, adenovirus, adeno-associated virus, SV40 virus, or 
herpes viruses. 

By a "substantially pure protein" is meant a protein 
20 which has been separated from at least some of those components 
which naturally accompany it. Typically, the protein is substantially 
pure when it is at least 60%, by weight, free from the proteins and 
other naturally-occurriiig organic molecules with which it is 
naturally associated in vivo. Preferably, the purity of the 
25 preparation is at least 75%, more preferably at least 90%, and most 
preferably at least 99%, by weight. A substantially pure TADG-15 
protein may be obtained, for example, by extraction from a natural 
source; by expression of a recombinant nucleic acid 
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encoding an TADG-15 polypeptide; or by chemically synthesizing the 
protein. Purity can be measured by any appropriate method, e.g., 
column chromatography such as immu no affinity chromatography 
using an antibody specific for TADG-15, polyacrylamide gel 
5 electrophoresis, or HPLC analysis. A protein is substantially free of 
naturally associated components when it is separated from at least 
some, of those contaminants which accompany it in its natural state. 
Thus, a protein which is chemically synthesized or produced in a 
cellular system different from the cell from which it naturally 

10 originates will be, by definition, substantially free from its naturally 
associated components. Accordingly, substantially pure proteins 
include eukaryotic proteins synthesized in £. coli, other prokaryotes, 
or any other organism in which they do not naturally occur. 

In addition to substantially full-length proteins, the 

15 invention also includes fragments (e.g., antigenic fragments) of the 
TADG-15 protein (SEQ ID No:2). As used herein, "fragment," as 
applied to a polypeptide, will ordinarily, be at least 10 residues, 
more typically at least 20 residues, and preferably at least 30 (e.g., 
50) residues in length, but less than the entire, intact sequence, 

20 Fragments of the TADG-15 protein can be generated by methods 
known to those skilled in the art, e.g., by enzymatic- digestion of 
naturally occurring or recombinant TADG-15 protein, by recombinant 
DNA techniques using an expression vector that encodes a defined 
fragment of TADG-15, or by chemical synthesis. The ability of a 

25 candidate fragment to exhibit a characteristic of TADG-15 (e.g., 
binding to an antibody specific for TADG-15) can be assessed by 
methods described herein. Purified TADG-15 or antigenic . fragments 
of TADG-15 can be used to generate new antibodies or to 
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test existing antibodies (e.g., as positive controls in a diagnostic 
assay) by employing standard protocols known to those skilled in the 
art. Included in this invention are polyclonal antisera generated by 
using TADG-15 or a fragment of TADG-15 as Che immunogen in, e.g., 
rabbits. Standard protocols for monoclonal and polyclonal antibody 
production known to those skilled in this art are employed. The 
monoclonal antibodies generated by this procedure can be screened 
for the ability to identify recombinant TADG-15 cDNA clones, and to 
distinguish them from known cDNA clones. 

Further included in this invention are .TADG-15 proteins 
which are encoded at least in part by portions of SEQ ID N0:2, e.g., 
products of alternative mRNA splicing or alternative protein 
processing events, or in which a section of TADG-15 sequence .has 
been deleted. The fragment, or the intact TADG-i5 polypeptide, may 
be covalently linked, to another polypeptide, e.g. which acts - as a 
label, a ligand or a means to increase antigenicity. 

The invention also includes a polyclonal or monoclonal 

antibody which specifically binds to TADG-15. The invention 

encompasses not only an intact monoclonal antibody, but also an 
immunologically-active antibody fragment, e.g., a Fab or (Fab)2 

fragment; an engineered single chain Fv molecule; or a chimeric 
molecule, e,g., an antibody which contains the binding specificity of 
one antibody, e.g., of murine origin, and the remaining portions of 
another antibody, e.g., of human origin. 

In one embodiment, the antibody, or a fragment thereof, 
may be linked to a toxin or to a detectable label, e.g. a radioactive 
label, non-radioactive isotopic label, fluorescent label, 
chemiluminescent label, paramagnetic label, enzyme 

20 
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label, or coiorimeiric label. Examples of suitable toxins include 
diphtheria toxin, Pseudomonas exotoxin A, ricin, and cholera toxin. 
Examples of suitable enzyme labels include maiate hydro genase, 
staphylococcal nuclease, delta-5-steroid isomerase, alcohol 
5 dehydrogenase, alpha-giycerol phosphate dehydrogenase, triose 
phosphate isomerase, peroxidase, alkaline phosphatase, asparaginase,, 
glucose oxidase, beta-galactosidase, ribonuclease, urease, catalase, 
glucose-d-phosphate . dehydrogenase, glucoamylase, 

acetylcholinesterase, etc. Examples of suitable radioisotopic labels 
10 include 3h, i^Sj^ 13Ii^ 32p^ 35s^ I4q ^^^^ 

Paramagnetic isotopes for purposes of in vivo diagnosis 
can also be used according to the methods of this invention. There 
are numerous examples of elements that are useful in magnetic 
resonance imaging. For discussions on in vivo nuclear magnetic 

15 resonance imaging, see, for example, Schaefer et ai., (1989) JACC 14, 
472-480; Shreve et aL, (1986) Magn. Reson. Med. 3, 336-340; Wolf, G. 
L., (1984) PhysioL Chem, Phys. Med. NMR 16. 93-95; Wesbey et aL, 
(1984.) PhysioL Chem. Phys. Med. iVAfi? 16, 145-155; Runge et aL, 
(1984) Invest, . Radiol 19, 408-415. Examples of suitable fluorescent 

20 labels include a fluorescein label, an isothiocyalate label,, a 
rhodamine label, a phycoerythrin label, a phycocyanin label, an 
allophycocyanin label, an ophthaldehyde label, a.fluorescamine label, 
etc. Examples of chemiluminescent labels include a luminal label, an 
isoluminal label, an aromatic acridinium ester label, an imidazole 

25 label, an acridinium salt label, an oxalate ester label, a luciferin label, 
a luciferase label, an aequorin label, etc. 

Those of ordinary skill in the art will .know of other 
suitable labels which may be employed in accordance with 

21 
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the present invention. The binding of these labels to antibodies or 
fragments thereof can be accomplished using standard techniques 
commonly known to those of ordinary skill in the art. Typical 
techniques are described by Kennedy et al., (1976) Clin. Chim, Acta 
5 70, 1-31; and Schurs et aL, {1911) Clin. Chim. Acta 81, 1-40. Coupling 
techniques mentioned in the latter are the glutaraidehyde method, 
the periodate method, the dimaleimide method, the m - 
maleimidobenzyl-N-hydroxy-succinimide ester method. All of these 
methods are incorporated by reference herein. 

10 Also within the invention is a method of detecting TADG- 

15 protein in a biological sample, which includes the steps of 
contacting the sample with the labeled antibody, e.g., radioactively 
tagged antibody specific for TADG-15, and determining whether the 
antibody binds to a component of the sample. 

15 As described herein, the invention provides a number of 

diagnostic advantages and uses. For example, the TADG-15 protein is 
useful in diagnosing cancer in different tissues since, this protein is 
highly overexpressed in tumor cells. Antibodies (or antigen-binding 
fragments thereof) which bind to an epitope specific for TADG-IS, 

20 are useful in a method of detecting TADG-15 protein in a biological 
sample for diagnosis of cancerous or neoplastic transformation. This 
method includes the steps of obtaining a biological sample (e.g., cells, 
blood, plasma, tissue, etc.) from a patient suspected of having cancer, 
contacting the sample with a labeled antibody (e.g., radioactively 

25 tagged antibody) specific for TADG-15, and detecting the TADG-15 
protein using standard immunoassay techniques such as an ELISA. 
Antibody binding to the biological sample indicates that the sample 
contains a component which specifically binds to an epitope 
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Within TADG-15. 

Likewise, a. standard Northern blot assay can be used to 
ascertain the relative amounts of TADG-15 mRNA in a cell or tissue 
obtained from a patient suspected of having cancer, in accordance 
5 with conventional Northern hybridization techniques known to those 
of ordinary skill in the art. This Northern assay uses a hybridization 
probe, e-g. radioiabelled TADG-15 cDNA, either containing the full- 
length, single stranded DNA having a sequence complementary to 
SEQID N0:1 (Figure 9), or a fragment of that DNA sequence at least 
10 20 (preferably at least 30, more preferably at least 50, and most 
preferably at least 100 consecutive nucleotides in length). The DNA 
hybridization probe can be labeled by any of the many different 
methods known to those skilled in this art. 

Antibodies to the TADG-15 protein can be used in an 
15 immunoassay to detect increased levels of TADG-15 protein 
expression in tissues suspected of neoplastic transformation. These 
same uses can be achieved with Northern blot assays and analyses. 

The present invention is directed to DNA encoding a 
TADG-15 protein selected from the group consisting of: (a) isolated 
20 DNA which encodes a TADG-15 protein; (b) isolated DNA which 
hybridizes to isolated DNA of (a) above and which encodes a TADG-15 
protein; and (c) isolated DNA differing from the isolated DNAs of (a) 
and (b) above in codon sequence due to the degeneracy of the genetic 
code, and which encodes a TADG-15 protein. Preferably, the DNA has 
25 the sequence shown in SEQ ID No:L More preferably, the DNA 
encodes a TADG-15 protein having the amino acid sequence shown in 
SEQIDNo:2. 

The present invention is also directed to a 

23 



wo 99/42120 



PCT/US99/03436 



vector capable of expressing the DNA of the present invention 
adapted for expression in a recombinant cell and regulatory elements 
necessary for expression of the DNA in the ceil. Preferably, the vector 
contains DNA encoding a TADG-15 protein having the amino acid 
sequence shown in SEQ ID No:2. 

The present invention is also directed to a host cell 
transfected with the vector described herein, said vector expressing a 
TADG-15 protein. Representative host cells include consisting of 
bacterial cells, mammalian ceils and insect cells. 

The present invention is also directed to a isolated and 
•purified TADG-15 protein coded for by DNA selected from . the group 
consisting of: (a) isolated DNA which encodes a TADG-15 protein; (b) 
isolated DNA which hybridizes to isolated . DNA of (a) above and which 
encodes a TADG-15 protein; and (c) isolated DNA differing from the 
isolated DNAs of (a) and (b) above in codon sequence due to the 
degeneracy of the genedc code, and which encodes a TADG-15 
protein. Preferably, the isolated and purified TADG-15 protein of 
claim 9 having the amino acid sequence shown in SEQ ID No:2. 

The present invention is also directed to a method of 
detecting expression of the protein of claim 1, comprising the steps 
of: (a) contacting mRNA obtained from the cell with the labeled 
hybridization probe; and (b) detecting hybridization of the probe 
with the mRNA. , 

The following examples ^ are given for the purpose of 
illustrating various embodiments of the invention and are not meant 
to limit the present invention in any fashion. 
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EXAMPLE 1 

5 Tissue collection and storage 

Upon patient hysterectomy, bilateral saipingo- 
oophorectomy, or surgical removal of neoplastic tissue, the specimen 
is retrieved and placed it on ice. The specimen was then taken to the 
resident pathologist for isolation and identification of specific tissue 

10 samples. Finally, the sample was frozen in liquid nitrogen, logged into 
the laboratory record and stored at -80°C. Additional specimens were 
frequently obtained from the Cooperative Human Tissue Network 
(CHTN). These samples were prepared by the CHTN and shipped to us 
on dry ice. Upon arrival, these specimens were logged into the 

15 laboratory record and stored at -80*^0. 

EXAMPLE 2 

20 mRNA isolation and cDNA synthesis 

Forty-one ovarian tumors (10 low malignant potential 
tumors and 31 carcinomas) and 10 normal ovaries were obtained 
from surgical specimens and frozen in liquid nitrogen. The human 

I* 

; ovarian carcinoma cell lines SW 626 and Caov 3, the human breast 
25 carcinoma cell lines MDA-MB-231 and MDA-MB-435S, and the human 
uterine cervical carcinoma ceil line Hela were purchased from the 
American Type Culture Collection (Rockville, MD). Cells were cultured 
to subconfluency in Dulbecco's modified Eagle's. medium, 
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suspended with 10% (v/v) fetal bovine serum and antibiotics. 

Messenger RNA (mRNA) isolation was performed 
according to the manufacturer's instructions using the Mini RiboSep^'^ 
Ultra mRNA isolation kit purchased from Becton Dickinson (cat, # 
30034). This was an oligo(dt) chromatography based system of 
mRNA isolation. The amount of mRNA recovered was quantitated b y 
UV spectrophotometry. 

First strand complementary DNA (cDNA) was synthesized 
using 5.0 mg of mRNA and either random hexamer or oligo(dT) 
primers according to the manufacturer's protocol utilizing a first 
strand synthesis kit obtained from Clontech (cat,# K1402-1). The 
purity of the cDNA was evaluated by PGR using primers specific for 
the p53 gene. These primers span an intron such that pure cDNA can 
be distinguished from cDNA that is contaminated with genomic DNA. 

EXAMPLE 3 

PGR reactions 

The mRNA overexpression of TADG-15 was determined 
using a quantitative PGR, 'Oligonucleotide primers were used for: 
TADG-15, forward 5'-ATGAGAGAGGATTGAGGTAC-3' (SEQ ID NO: 10) 
and reverse 5'-GAAGGTGAAGTGATTGAAGA-3' (SEQ ID NO: 11); and p- 

tubulin, forward 5'.TGCATTGACAAGGAGGG-3' (SEQ ID NO: 12) and 
reverse 5'-.GTGTGTTGACATTGTrG-3' (SEQ ID NO: 13), p -tubulin was 
utilized as an internal control. Reactions were carried out as follows: 
first strand cDNA generated from 50 ng of mRNA will be used as 
template in the presence of 1.0 mM MgGli, 0,2 mM dNTPs, 0.025 U 
Taq polymerase/mi of reaction, and 1 x buffer supplied with 
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. enzym,e. In addition, primers must be added to the PGR reaction. 
Degenerate primers which may amplify a variety of cDNAs are used 
at a final concentration of 2.0 mM each, whereas primers which 
amplify specific cDNAs are added to a final concentration of 0,2 mM 
each. 

After initial denaturation at 95°C for 3 minutes, thirty 
cycles of PGR are carried out^ in a Perkin Elmer Gene Amp 2400 
thermal cycler. Each cycle consists of 30 seconds of denaturation at 
95°G, 30 seconds of primer annealing at the appropriate annealing 
temperature, and 30 seconds of extension at 72'^G, The final cycle will 
be extended at 72*'G for 7 minutes. To ensure that the reaction 
succeeded, a fraction of the mixture will be electropihoresed through a 
2% agarose/TAE gel stained with ethidium bromide (final 
concentration 1 mg/ml). The annealing temperature varies according 
to the primers that are used in the PGR reaction. For the reactions 
involving degenerate primers, an annealing temperature of 48*'G were 
used. The appropriate annealing temperature for the TADG-15 and p- 
tubulin specific primers is 62^G, 

EXAMPLE 4 

T-vector ligation and transformations 

The purified PGR products are ligated into the Promega T- 
vector plasmid and the ligation products are used to transform JM109 
competent cells according to the manufacturer's instructions (Promega 
cat. #A3610). Positive colonies were cultured for amplification, the 
plasmid DNA isolated by means of the Wizard^^ Minipreps DNA 
purification system (Promega cat #A7500), and the plasmids were 
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digested with Apal and Sad restriction enzymes to determine the size 
of the insert. Plasmids with, inserts of the size(s) visualized by the 
previously described PGR product gel electrophoresis were sequenced. 

EXAMPLE 5 

DNA sequencing 

Utilizing a plasmid specific primer near the cloning site, 
sequencing reactions were carried out using PRISM Ready Reaction 
Dye Deoxy'^^ terminators (Applied Biosystems cat# 401384) according 
to the manufacturer's instructions. Residual dye terminators were 
removed from the completed sequencing reaction using a Centri- 
sep^^ spin column (Princeton Separation cat.# CS-901). An Applied 
Biosystems Model 373A DNA Sequencing System was available and 
was used for sequence analysis. Based upon the determined 
sequence, . primers that specifically amplify the gene of interest were 
designed and synthesized. 

EXAMPLE 6 

Northern blot analysis 

10 |xg mRNAs were size separated by electrophoresis 

through a 1% formaidehyde-agarose gel in 0,02 M MOPS, 0.05 M 
sodium acetate (pH 7.0), and 0.001 M EDTA. The mRNAs were then 
blotted to Hybond-N (Amersham) by capillary action in 20 x SSPE. 
The RNAs are fixed to the membrane by baking for 2 hours at SO^'C, 
Additional multiple tissue northern (MTN) blots were purchased from 
(XONTECH Laboratories, Inc. These blots include the Human 
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MTN biot (cat.#7760-l), the Human MTN II blot (cat.#7759-l), the 
Human Fetal MTN II blot (cat.#7756-l), and the Human Brain JvITN 
III blot (cat.#7750-l). The appropriate probes were radiolabelled 
utilizing the Prime-a-Gene Labeling Systeni available from Pr omega 
5 (cat#Ul 100). The blots were probed and stripped according to the 
ExpressHyb Hybridization Solution protocol available from CLONTECH 
(cat.#80l5-l or 8015-2). 

EXAMPLE 7 

10 Quantitative PGR 

Quantitative-PCR was performed in a reaction mixture 
consisting of cDNA derived from. 50 ng of mRNA, 5 pmol of sense and 
■antisense primers for TADG-15 and the internal contror P-tubulin, 0.2 
mmol of dNTPs, 0.5 mCi of [a-^^P]dCTP, and 0.625 U of Taq 

1 5 polymerase in 1 x buffer in a final volume of 25 ml. This mixture was 
subjected to 1 minute of denaturation at 95!'C followed by 30 cycles of 
denaturation for 30 seconds at 95''C, 30 seconds of annealing at 62°C, 
and 1 minute of extension at with an additional 7 minutes of 

extension on the last cycle. The product was electrophoresed through 

20 a 2% agarose gel for separation, the gel was dried under vacuum and 
autoradiographed. The relative radioactivity of each band was 
determined by Phospholmager from Molecular Dynamics. 

25 EXAMPLE 8 

The present invention describes the use of primers 
directed to conserved areas of the serine protease family to 

29 
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identify members of that family which are ^ overexpressed ^ in 
carcinoma. Several genes were identified and cloned in other tissues, 
but not previously associated with ovarian carcinoma. The present 
invention describes a protease identified in ovarian carcinoma. This 
gene was identified using primers to the conserved area surrounding 
the catalytic domain of the conserved amino acid histidine and the 
downstream conserved amino acid serine which lies approximately 
150 amino acids towards the carboxyl end of the protease. 

The gene encoding the novel extracellular serine protease 
of the present invention was identified, from a group of proteases 
overexpressed in carcinoma by subcloning and sequencing the 
appropriate PGR products. An example of such a PGR reaction is given 
in Figure 1. Subcloning and sequencing of individual bands from such 
an amplification provided a basis for identifying the protease of the 
present invention. 

EXAMPLE 9 

The sequence determined for the catalytic domain of 
TADG-15 is presented in Figure 2 and is consistent with other serine 
proteases and specifically contains conserved amino acids appropriate 
for the catalytic domain of the trypsin- like serine protease family. 
Specific primers (20mers) derived from this sequence were used. 

A series of normal and tumor cDNAs were examined to 
determine the expression of the TADG-15 gene in ovarian carcinoma. 
In a series of normal derived cDNA compared to carcinoma derived- 
cDNA using P-tubulin as an internal control for PGR amplification, 

TADG-15 was significantly overexpressed in all of the 
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carcinomas examined and either was not detected or was detected a t 

a very low level in normal epithelial tissue (Figure 3). This evaluation 

a 

was extended to a standard panel of about 40 tumors. Using these 
specific primers, the expression of this gene was also examined in 
5 tumor, cell lines derived from both ovarian and breast carcinoma 
tissues as shown in Figure 5 and in other tumor tissues as shown in 
Figure 6, The expression of TADG-15 was also observed in carcinomas 
of the breast, colon, prostate and lung. 

Using the specific sequence for TADG-15 covering the full 

10 domain of the catalytic site as a probe for Northern blot analysis, 
three Northern blots were examined: one derived from ovarian 
tissues, both normal and carcinoma; one from fetal tissues; and one 
from adult normal tissues. As shown in Figure 7, TADG-15 transcripts 
were noted in all ovarian carcinomas, but were not present in 

15 detectable levels in any of the following tissues: a) normal ovary, b) 
fetal liver and brain, c) adult spleen, thymus, testes, overy and 
peripheral blood lymphocytes, d) skeletal muscle, liver, brain or 
heart. The transcript size was found to be approximately 3,2 kb. The 
hybridization for the fetal and adult blots was appropriate and done 

20 with the same probe as with the ovarian tissue. Subsequent to this 
examination, it was confirmed that these blots contained other 
, detectable mRNA transcripts 

Initially using the catalytic domain of the protease to 
probe Hela cDNA and ovarian tumor cDNA libraries, one clone was 

25 obtained covering the entire 3' end of the TADG-15 gene from the 
ovarian tumor library. On further screening using the 5' end of the 
newly detected clones, two more clones were identified covering the 
5^ end of the TADG-15 gene from the Hela library (Figure 8). The 
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complete nucleotide sequence (SEQ ID No:l) is provided in Figure 9 
along with translation of the open reading frame (SEQ ID No:2). 

In the nucleotide sequence, there is a Kozak sequence 
typical of sequences upstream from the initiation site of translation. 
5 There is also a poly-adenylation signal sequence and a poly- 
adenyiated tail. The open reading frame consists of a 855 amino acid 
sequence (SEQ ID No:2) which includes an amino terminal cytoplasmic 
tail from amino acids 1-50, an approximately 22 amino acid 
transmembrane domain followed by an extracellular sequence 

10 preceding two CUB repeats identified from complement 
subcomponents Clr and Cls. These two repeats are followed by four 
repeat domains of a class A motif of the LDL receptor and these four 
repeats are followed by the protease enzyme of the trypsin family 
constituting the carboxyl end of the TADG-15 protein (Figure 11). 

15 Also a clear delineation of the catalytic domain conserved histidine, 
aspartic acid, serine series along with a series of amino acids 
conserved in the serine protease family is indicated (Figure 10). 

A search of GeneBank for similar previously identified. 

i • 

sequences yielded one such sequence with relatively high homology 
20 to a portion of the TADG-15 gene. The similarity between the portion 
of TADG-15 from nucleotide #182 to 3139 and SNC-19 (SEQ- ID No: 9; 
GeneBank accession #1120428) is approximately 97% (Figure 12). 
There are however significant differences between SNC-19 and TADG- 
15 viz. TADG-15 has an open reading frame of 855 amino acids 
25 whereas the longest ORF of SNC-19 is only 173 amino acids. SNC-19 
doeis not include a proper start site for the initiation of translation nor 
does it include the amino terminal portion of the protein encoded by 
TADG-15. Moreover, SNC-19 does not include an ORF for a 
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functional serine protease because tiie His, Asp and Ser residues 
necessary fer function are encoded in different reading frames. 

TADG-15 is a highly overexpressed gene in tumors. It is 
expressed in a limited number of aormal tissues, primarily tissues 
that are involved in either uptake or secretion of molecules e.g, colon 
and pancreas. TADG-15.is further novel in its component structure, of 
domains in that it has a protease catalytic domain which could be 
released and used as a diagnostic and which has the potential for. a 
target for therapeutic intervention. TADG-*15 also has ligand binding 
domains which are commonly associated with molecules that 
internalize or take-up ligands from the external surface of the cell as 
does the , LDL receptor for the UDL cholesterol complex. There is 
potential that these domains may be involved in uptake of specific 
ligands and they may offer the potential for making delivery of toxic 
molecules or genes to tumor cells which express this molecule on their 
surface. It has features that are similar to the hepsin serine protease 
molecule in that it also has an amino- terminal transmembrane 
domain with the proteolytic catalytic domain extended into the 
extracellular matrix. The difference here is that TADG-*15 includes 

ft 

these ligand binding repeat domains which the hepsin gene does not 
have. In addition to the use of this gene as a diagnostic or therapeutic 
target in ovarian carcinoma and other carcinomas including breast, 
prostate, lung and colon, its ligand-binding domains may be valuable 
in the uptake of specific molecules into tumor cells. 

Any patents or publications mentioned in this 
specification are indicative of the levels of those skilled in the art to 
which Che invention pertains. . These patents and publications are 
herein incorporated by reference to the same extent as 
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if each individual publication was specifically and individually 
indicated to be incorporated by reference. 

One skilled in the art will readily appreciate that the 
present invention is well adapted to carry out the objects and obtain 
5 the ends and advantages mentioned, as well as those inherent therein, 
. The present examples along with the methods, procedures, 
treatments, molecules, and specific compounds described herein are 
presently representative of preferred embodiments, are exemplary, 
and are not intended as limitations on the scope of the invention. 
10 Changes therein and other uses will occur to those skilled in the art 
which are encompassed within the spirit of the invention as defined 
by the scope of the claims. 
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WHAT IS CLAIMED IS: 



1. DNA encoding a TADG-15 protein selected from the 

5 group consisting of: 

(a) isolated DNA which encodes a TADG-15 protein; 

(b) isolated DNA which hybridizes to isolated DNA of (a) 
above and which encodes a TADG-15 protein; and 

(c) isolated DNA differing from the isolated DNAs of (a) 
10 and (b) above in codon sequence due to the degeneracy of the genetic 

code, and which encodes a TADG-15 protein. 



2. The DNA of claim 1, wherein said DNA has the 
15 sequence shown in SEQ ID No:L 



3. The DNA of claim 1, wherein said TADG-15 protein 
has the amino acid sequence shown in SEQ ID No:2. 



4. A vector capable of expressing the DNA of claim 
1 adapted for expression in a recombinant cell and regulatory 
elements necessary for expression of the DNA in the cell. 



5. The vector of claim 4, wherein said DNA encodes a 
TADG-15 protein having the amino acid sequence shown in SEQ ID 
No:2. 
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6. A host ceil transfected with the vector of claim 4, 
said vector expressing a TADG-15 protein. 



5 



7. 



The host cell of claim 6, wherein said cell is selected 



from group consisting of bacterial cells, mammalian cells, plant cells 
and insect cells. 



9. Isolated and purified TADG-15 protein coded for by 
DNA selected from the group consisting of: 

(a) isolated DNA which encodes a TADG-15 protein; 

(b) isolated DNA which hybridizes to isolated DNA of (a) 
20 above and which encodes a TADG-15 protein; and 

(c) isolated DNA differing from the isolated DNAs of (a) 
and (b) above in codon sequence due to the degeneracy of the genetic 
code, and which encodes a TADG-15 protein. 

25 10. The isolated and purified TADG-15 protein of claim 

9 having the amino acid sequence shown in SEQ ID No:2. 



8. 



The host cell of claim 7, wherein said bacterial ceil is 



£• colL 



15 
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11. A method of detecting expression of the protein of 

claim 1, comprising the steps of: 

(a) contacting mRNA obtained from the ceil with the 

labeled hybridization probe; and 

(b) detecting, hybridization of the probe with the mRNA. . 
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I 



151 
201 
251 
301 
351 
401 
451 
.501 
551 
601 
651 
701 
751 
801 
851 



MGS DRARKGG GGPKDFGAGL KYNSRHEKVN GLEEGVEFLP VMNVKKVEKH | 1 
gpgfIwvvlaa VLIGLLLVLL GIGFLVW HLQ YRDVRVQKVF NGYMRITNEN 2 



101 FVDAYENSNS T'SFVSLASKV KDALKLLYSG VPFLGPYHKE SAVTAFSEGS 



VIAYYWSEFS IPQHLVEEAE RVMAEERVVM LPPRARSLKS FVVTSVVAFP 



TDSKTVQRTQ DNSCSFGLHA RGVELMRFTT PGFPDSPYPA HARCQWALRG 
DADSVLSLTF RSFDLASCDE RGSDLVTVYN TLSPMEPHAL VQLCGTYPPS 



Y iNLTt fHSSQN VLLITLITNT ERRHPGFEAT FFQLPRMS3C GGRLRKAQGT 



FNSPYYPGHY PPNIDCTWNI EVPNNQHVKV SFKFFYLLEP GVPAG.TCPKD 



YVEINGEKYC GERSQFVVTS NSNKITVRFH SDQSYTDTGF LAEYLSY 



DSS 



DP 



PGQFTCR TGRCIRKELR CDGWADCTDH |SDEJLNCSCDA GHQFTCKNKF 
CKPLFWVCDS VNDCGDN ^DE] QGCSCPAQTF RCSNGKCLSK SQQCNGKDDC 
GDC jSDEK SCP KVNVVTCTKH TYRCLNGLCL SKGNPECDGK EDCSDG |SDEt < 



DCDCGLRSFT RQARWGGTD ADEGEWPWQV SLHALGQGHI CGASLISPNW 



LVSAA^YID DRGFRYSDPT. QWTAFLGLHD QSQRSAPGVQ ERRLKRIISH- 
PFFNDFTFDY fijTALLELEKP AEYSSMVRPI CLPDASHVFP AGKAIWVTGW 



ghtqyggtga lilqkgeirv inqttcenll pqqitprmmc vgflsggvds 
cqgi: ©3gpls sveadgrifq agvvswgdgc aqrnkfgvyt rlplfrdwik 



entgv 



(SEQ.ID NO: 2) 



: Conserved cysteine residue 



NXT* 



SDE 



: Possible N-linked glycosylation site 
: Conserved SDE motif 
Q : Potential cleavage site 

: Conserved amino acids of catalytic triad H, D, S 



1. Cytoplasmic domain 

2. Transmembrane domain 

3. CUB repeat 

4. Ligand-binding repeat (class A motif) 
of LDL receptor like domain 

5. Serine protease 



FIG. 10 
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LOCUS 
MID 

XEYWCnDS 



AUTHORS 
7ITL£ 

JOUP.N.-.L 

AUTHORS 



:-:su2042a 2900 sp 

Human SNCI9 mSKA sequence, 

U204 2fl 

gl890631 



human. 
Ho.TJo sapiens 

iukaryotae; micocnondriaJL euksryct* 
Vertebrata; Eutl^eria; Primatfts; 

1 (bases 1 2300) 

Zheng, S., Cai.X-. , Gtsnq,L.. Cao.J., 
SNC19 gene in Hoao sapiens 
U.-o'jGiished 

2 (bases 1 zo 2900) • 
^r,enq,S. 

Direct Subniission 

Submitted {3Q-a-^ii-1995) Shu Zher,q, ^ 
Medical Universicy, Hcngzhcu. 31CiQC 



- - A- - T 



S 



i c:-.ir.5 



TA0G15 : xc\AGAGCGGCCTCGGuGiACtAib«*jUrt.* r'.. -.uw-^i-w- 

/ ■ 

5J:Ci9: 

S2 CiJkGTACV^XTCCCGGCACGAGPuVvGTa-Ji.TGGCTTGa^^^^ ----.^ , ; -w, — - 

— r-r-rr/— '(-T---- - — --TTrr't-.'"'^TGGC-.TTTGCAGT,^CCGGGACGTGCGTGTCC 231 

;S2 CGCTGGGTGGTGCTGGCAGCCGTGCTGATCGGCO , , I 1 i I M I I IM I I I I ! : . I ! M I t 

I • CGCTGGGTGGTGCrGGCAGCCGTGCrasTCGGCCTCCTCTTGGTC 1 i , ^-,Gvs.-. i 1 i CC x s. w * v. , vj 

_ , ^ i t TT-7T r T n " ♦ ^- ._p~p{-t.i-Trc%CIGftGTTTG7A^.GCCrGGCC\GCrtAGGT 381 

;-S2 ---^C-CTTC^'-^^^-^^-^^^^^^ I I I M I ! I t M I I I 1 i I I M I 1 I I I 1 i 

: C? 1 Al-.-.GGTCTTC---,IGGCTACATGAGGATCACAAAi GAGAni n i , ^ * w i.-.^^-.^-^.C . ^ w .- . 

-2 C:-:..GG^.CGCGCTa^.= GCTGCTGTAa».a^^^^ ( I Ml f I [ I 11 M I M I I 11 I I I I I 

:0 1 Gjij:.C-GACGCGCTGA.nGCTGCTGTACAGCGGAGTCCCAT iCCTGGG«C«. w>^>--.--^-,o i CC^- — 

rtT-.~^i— r i-f fy i- r ■* *— f-— »-•• — '•r—rri ""i'i^^GCGTAGTCArGC'TGCCCCCGCGGGCGCGCT 531 
.82 -CTACTGGICIC^GT7CAGC?JCCCGC^.GCACv-.G^^^ , 1 M I t M I I 1 I M I i I I I I I I i ( I I 

I I I m.l M I I I I I I ! I m I I I '11 'i'jJiU--- i:lillli-.' ^ :;^ciciiiG7C:.7GC7GCCCCCGCG6GCGCGCT 399 

iOl 7ACTAC7GG7C7G.AGTTCAGC^TCCCGCAGg=.CCTGG7TGAGG^.^Gw>.^.^n-o>-ciu.-.,-^^^ 

032 CCC7G^AGTCC777GTGGTCACC7CAG7GG7GGC7TTCC^^^^ I ! I i N i f I I ! ! 1 i 1 t 1 i I I I I 1 I I I 

■ 400 •CCCTG.-.-.GTCC777G7GGTCACCTCAGTGGTGGC77TCCCC.VCGG.-.C, ..-..-r.G.nG^-.-^C~ — - 

^^^^^^r-T-rrrrr- ^ -.--^^-~^,-^>--c/-7CCCAG7GGGCCC7GCGGGGGG.=iCGCCGAC 7Si 

6c2 CCGCGG7G7GG.AGC7G?.TGCGC7TCACCACGCCCGOT , i | I I I M I ! H 

^59 CCGCGG7G7GGAGC7GA7GCGC77C-.CC-.CG.CCGCCT7CCC7v=.-.C-.w i,-.--i-w,.,--iC/.,i ^ 

^^^r*^/-/ — .-■~r-'-'-'-r-i'-'*'':irrTGG7GAGGSTG7A.CA.-.C-,CCC7GAGCCCCn7 

:S2 TCAG7GCTG^.GCC:CACC77C ^^^f ?^TTf?^^^rf " -^T?': m T M , t I, ! i M M t I III 1 1 i MI I I I Ml 

593 G W.GTGC7GAGC7AC7 CGAGC7GACTCGCAGC . 77GAC7GCGCCT w w^.-.wGC>.vjC i vju * «. 

^.^^^^^rrrrrr-rr? — r;-r-TcrirT cC7CCCA.Gr.ACG7CCTGCTC^TCACACrGATAA 974 

S-n Ga=.-GCCCC;:XGCCC7GGTGCAGT7G7G7GGCACC7AC^ , , M 1 M M Jl t 1 1 l M H 11 1 111 1 I I H 

6ST GaAGCCCCACG.CC7GGTG. .AGTGTGTGGCACCTACCCTCCC7CCi--— -_-vCwi>^^^* *^C«Ci -v,uiCi-u.ti-«- 

^-^^T-rr— / r-.- r'lir.GCCGCTiACGTA.AA.GCCCAGGGGACATT iO"74 

975 CC^-CACTCACCGGCGGCATCCCGGCTTTGAGGra^^ , I I 1 Ml ( II I H ! D 1 i II t I 

I 1 H N i H I I I 1 II II I 1 i II II II f II M II I m m M 1 M ^^1_M M gg. 
1 8 4 CCiACAC7GA. . CGCGGCATCCCGGCTT7GAGGCCACCT iCTtCC^« ^ . c-^u i^G^w va-^uC-X-^ . w-.G^i. .-uuv.wv. 

'111 ii-llirir-rr ' ---H i - 1 11 li 111111^1^1-! ir^^^^ 5cl 
SS2 C-ACA.GCCCCTAC7ACCCAGGCCACTACCCACCCiACAi7GACiGC-,-.^,. ^u.™-.. - ^-.GGiG-s.w^ 

^r—A-(-rrr- " — -r- - Tr:-":-Tir:-- -•TA^7t;CC•IlA::^-.GAGGTCCC^G77CGTCG 127 4 

1173 77CTACC7GCTGSACCCCGGCG7GCCTGCGGGCACC7GCCCC-.--v«^^-.'- .^-^^ . ^^^.Cy^.Q^^ 7.7.". 7. • i l i i i 1 . } IM 11 I! 1 I 1 I II 1 ! I I I 

I m Ml II 11 II 111 M Ml I t I I II III I I 1 i 1 I M I MM 'lliilil'li' 'lili' 11111 ili^illlllliii^^^ lOBl 
?9 2 77CTACC7GC7GG.-.GCCCGGCG7GCCTGCGGGCACCTGCCCOA.Ga-.. . A -G7GG--.a-, , CWGG,-G.-.«.-»-— - - -^-^ *^ ,u«TCUWVi.i 

;275 7CACCAGCAACAGC::ACAAGA7CACAG77CGC77CCACTCAGATCA-CZTACACCGACACCGGrT^ ' 

■ QS2 7CACCAGCA,AC-.GCV.CAAGATCACAGT7CGC7TCCACTCAan.C-^w. ,...-.C.-.C^^-.-.-.CCGG^. iC: 

137= C=avTGCCCS===^.=TTC^CGTGCCGC.CGSC-GCG=TGTATCCCC>A:.^^:~C.::."-G^TC...C-C.GSCC^ 1^ ' * 

n M n M n . : n ; 1 1 1 1 n n i m 1 1 1 ' i'lil' 'ii].-: llillllLlilllLililll i^irkiickcivcAiciii^^Tc :2so 

1152 CCC.A7GCCCGGGGC.AG77CACG7GCCGCACGGGGCGGiG.«iCCw^-.-._-^-._-^. -•--->.. ^.'^^^ 

» ^ • ^^ni,-B^r— I , rTr-r'rrrTGCG.-C-.G7G7.SAACGAC7GCGGAGACAACA 1574 

1475 ,^.-.CTGCAG7TGCG.-.CGCCGGCCACCAGT7C^CG7GCAAaV.C««G. ,-,.-.,-.vwCw. ; ; ^.G---^ C C----- , , , , , , , , | n M I 1 I " 

I M M II M 1 M 1 1 M m u n n 1 1 i n 1 .1 1 1 n i m i i n m m ; ; i n inii»liill»l ' lUllUil^Ac^ 1377 

12S: .ijiCTGCAGT7GCGAC5CCGCCCACaiG7TCACG7GCA.AaAGCV.G77::GC--AG. . .C7CTiC;G^>^iC;u^-.-. .w.uv.«^wt 
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i H I M M M . ! I nil i I 1 M n I il' ili • 'iiliii' ' 'li' iii^^^^ • 
me GCG;i.CG?.GC>.GGGT7GCATT7C7CCGGACCCAGACCTiwA«GTGTiv.C-.-.iG^u.^^ 

^^^^^t.r-'n r-', wrArarr7iCCGCT2CCTC:a,s.TG:::7C7GCrrTGAGC-^^^ 1773 

U^4 GGGACGGG7CCG.:^CGAGGCCTCC7GCCCCAAGG7aA. .C^ jG.. (>.-ACA(^.CC^^ , , , M I I m 1 S M U 1 Hi 1 11 1 1 1 ! S I i i 51 ! L 

' 473 GGG.:^CGGG7CCGACanGGCCTCCTGCCCC>>.GGi a?-^.C..T.ui^-.,* .G...,C.--^C^-^.u-...LuUi.. 

.7. CCC7G.GTGrG.CGGG;.GGAGG.CTG7^CG.CGG^^ ^^^^ 

;a7. CGCACGG.TCCCC^7G.GGGCG.G7GGCCCTGGC.G^ ^^^J 
1678 GGCACCGA7GCGGAiGrtGi»oCG.tGiUMw^'«i^^»-^'^ 

.e.. iA. .I^oli^^^^i^^^ '''' 

^^^^rs^n^ T'rrr-'-r'rn r'--rrrrt.7G7C7-CCC'^GCCGGCr_iGGZC-.rC7GGG7CACGGGC7GGG 2273 

:i7; CP.GJ^A^CCGGCAa^G7AC^GC7CCATGG7GCGGCCC^.:C|^ -GC^ j ,1 Ml I 1 M i 1 I I I I 

1 u m n M n n i i m M n ! 1 1 ! m 1 1 ' ' i ' 1 iiliiii^i ' ' i.1 ;lir~Gi<y:iG7C77cccTGC^ 2Q7 3 

1974 i^aliJ^.CCGGC-.GAG7Aa.GCTCCATGG7GCGCCCCAis.aGCCiGK.CG iCCCiv.^^- 

, ^ - - - r-ri r- -rr-^rrr-rrzrr^ trc-^- CC*-CCTGCGP.a-.-.CCTCC7GCCGaiGCAG?-7 23*? 3 

2ilA C^CAC.CCC..aTArGGAG=CX=TGGCGCGCTGATCCTGCA-,^G^^^ ,,,111.111,111111. I . . „ 



mint 

2174 CACGCCGC 



CGCA7a.7GTGCG7GGGC77CCTCAGCGGCG-.^ M M ITl mTi I I M. I i I I H 1 M IN 1 I I 1 I i I I I 1 I 

CGw^TGrtTGTGCG^GGGC7TCCTC.nwvw-C-CG^-•C jTGa-.w i CC . «n-u.-.^CG, w .t ; ui-*--^ 



A7C-TCC:..GGCCGG7GTGGTGAGCTGC.Ga:..G^.CC-KTGCrc.C,^^ i , I I 1 I i i H ! I i 1 H . 1 1 1 1 1 I . II . I . 

2274 ATCTTCa>SGCCGGTG?GG-G^TGaGG?.a^C . GCTGv.G..TC.-^^-G~,0«v.CC-.GGCw.T5 ..-^L. 

237, AG.:.>.CKCTGGC-GIAT?.GGC-GCCG(KC^C^CCCA.»ATG.G...C^^ ,1111111111111111111111 

III 1 1 1 1 1 I I H m I II .. MI 1 . 1 1 ''I'll iL^r-r-lr-lr^a-^^^^^^^^ "'^ 
2313 AG>sCACTC<3GGTATAGGGGCCGGGGCC^.CCC?J^ATG^GiAC^.CuJG^.Gv.C--■<.w■^^C._-..CG.CC>.^ 

2671C^.CCGCTG>.=TGC.rC.^CGCCCCC..G;^.CAT..C..CTGT^^^ 

'473 GCAC=GTG.^C-GCACC^QCG.CCCC?.GA.:.CATACACTG7G»«CiC.AiC>CCAGG..C.C«~..CT5.i..^ 

rr-, -rr, rM-^.«iJiJiGG7TTaViai.C?.tVC<C-CCCCCGCCA6CCCC?AGC 23S8 

2771 AGTGGA.GC7GGa^.GGTAG?AGGGS.^GG..:.CVCTGG.GGj|CT..O I ,,1,11111 I 

11111. uiii.i Miiiiuiniii 1 1 Ml II II m MM 11 1, n ciiGGUrkUc-.c^^c^:-. • • -gtccggcagccc. .jag 26S3 

nSS AGTGa=.GCT3--anGGG-AG?.AC-GGGAGGAA=;iCTGGiGG. iv..«C7G«CCC-^CiGG«... . O-CC. . — 

- — r; r-(-Sit-G(li'--C^CGG?.3=CTCC7C?.:T-S.:.V2GTG3TGGGGCTGCCGG 2s»5 

2849 TGGGCCaWGCGCGT7TGTGTA7A7CTGCC7CCCCTCTC.G..^-.>.>p^^^ ,,,,, ,,,imi|ll 

nil. 1 lllimill HI II' H I ^il J TC7C7iG72.-. G7GTE\CTGCCGG 273s 

2fi59 TGG3C^AGS;.CGCGTrrGTGCf.TA.CTGCC.CTGCTC.Ai.■^a«:G«A«»•.v.>..-.v« ^ 

2969ATCTGGGC7G7GGGGCC=77GGGCC.rGC7C7T^.GG..GCCCA.GGC7C^ 

273. ililii . A^l^l:^:^':^^^ 

"=tr=?i^;?:f ;;;; ^Sio u,;.^ 

AGG, .7G^.C77C-.GTG7GTG7A.77G7G7-r>^4G.-^G*A^----.C.i i i i .Ci . i i 



{ I t 

2833 CCC 
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SEQUENCE LISTING 

' <110> O'Brien, Timothv J. 

Tanimo to , Hir o to shi 

<120> TADG-15: An Extracellular Serine Protease 

Overexpressed in Breast and Ovarian Carcinomas 

<130> D6064PCT 

<140> PCT/US99/03436 

<141> 1999-02-18- 

' • <150> US 09/027,337 

<151> 1998-02-20 

<160> 14 

<210> 1 

<211> ■• 3147. 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> cDNA sequence of TADG-15" 

<400> 1 

tcaagagcgg cc tcggg'gta ccatggggag cgatcgggcc cgcaagggcg 50 

gagggggccc gaaggacttc ggcgcgggac tcaagtacaa 'ctcccggcac 100 

gagaaagtga atggcttgga ggaaggcgtg gagttcctgc cagtcaacaa 150 

cgtcaagaag g.tggaaaagc atggcccggg gcgctgggtg gtgctggcag 200 

ccgtgctgat cggcctcctc- ttggtcttgc tggggatcgg cttcctggtg 250 

tggc.atttgc agtaccggga cgtgcgtgtc cagaaggtct tcaatggcta 3 00 

catgaggatc acaaatgaga attttgtgga tgcctacgag aactccaact .350 

ccactgagtt tgtaagcctg gccagcaagg tgaaggacgc gctgaagctg 400 

ctgtacagcg gagtcccatt cctgggcccc taccacaagg agtcggctgt 450 

gacggccttc agcgagggca gcgtcatcgc ctactactgg tctgagttca 500 

gcatcccgca gcacctggtg gaggaggccg agcgcgtcat ggccgaggag 550 

cgcgtagtca tgctgccccc gcgggcgcgc tccctgaagt cctttgtggt 600 

cacctcagtg gtggctttcc ccacggactc caaaacagta cagaggaccc 650 

aggacaacag ctgcagcttt ggcctgcacg cccgcggtgt ggagctgatg 700 

cgcttcacca cgcccggctt ccctgacagc ccctaccccg ctcatgcccg 750 

Gtgccagtgg gccctgcggg gggacgccga ctcagtgctg agcctcacct 800 

tccgcagctt tgaccttgcg. tcctgcgacg agcgcggcag cgacctggtg 850 

acggtgtaca acaccctgag ccccatggag ccccacgccc tggtgcagtt 900 

gtgtggcacc taccctccct cctacaacct gaccttccac tcctcccaga 950 

acgtcctgct catcacactg ataaccaaca ctgagcggcg gcatcccggc 1000 

SEQ 1/17 

cfinc-rrm-pe c?t_ac:=r-r mi ii cr •5K^ 
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tttgaggcca ccttcttcca 
cttacgtaaa gcccagggga 
acccacccaa cattgactgc 
ca t g t gaagg t gage 1 1 caa 
tgcgggcacc tgccccaagg 
g.cggagagag gtcccagttc 
gttcgcttcc actcagatca 
atacctctcc tacgactcca 
gcacggggcg gtgtatccgg 
tgcaccgacc acagcgatga 
g 1 1 c acgtgc aagaacaagt 
gtgtgaacga ctgcggagac 
gcccagacct tcaggtgttc 
gtgcaatggg aaggacgact 
ccaaggtgaa cgtcgtcact 
gggccctgct tgagcaaggg 
cagcgacggc tcagatgaga 
cgagacaggc tcgtgttgtt 
ccc tggcagg taagcctgca 
ttccctcatc tctcccaact 
atgacagagg attcaggtac 
ggcttgcacg accagagcca 
gctcaagcgc atcatctccc 
atgacatcgc gctgctggag 
gtgcggccca tctgcctgcc 
ggccatctgg gtcacgggct 
cgctgatcct gcaaaagggt 
gagaacctcc tgccgcagca 
c c t c agcggc ggc g tggac t 
ccagcgtgga ggcggatggg 
ggagacggct gcgctcagag 
tctgtttcgg gactggatca 
cacccaaatg tgtacacctg 
acgcctgcag gctggagact 
catacactgt gaactcaatc 
ctcgcttcct cagcctccaa 
ctggtggttc tactgaccca 
cccccgccag ccccaagctg 
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gctgcctagg atgagcagct gtggaggccg 1050 

cattcaacag cccctactac ccaggccact 1100 

acatggaaca ttgaggtgcc caacaaccag 1150. 

attcttctac ctgctggagc -ccggcgcgcc 1200 

actacgtgga gatcaatggg gagaaatact 1250 

gccgtcacca gcaacagcaa caagatcaca 13 00 

gtcctacacc gacaccggct: tcttagctga 1350 

gtgacccatg cccggggcag ttcacgtgcc 1400 

aaggagctgc gctgtgatgg ctgggccgac 1450 

gctcaactgc agttgcgacg ccggccacca 1500 

tctgcaagcc cctcttctgg gtctgcgaca 1550 

aacagcgacg agcaggggtg cagttgtccg 1500 

caatgggaag tgcctctcga aaagccagca 1650 

gtggggacgg gtccgacgag gcctcctgcc 1700 

tgtaccaaac acacctaccg ctgcctcaat 17 50 

caaccctgag tgtgacggga aggaggactg 1300 

aggactgcga ctgtgggctg cggtcattca 1850 

gggggcacgg atgcggatga gggcgagtgg 1900 

tgctctgggc cagggccaca tctgcggtgc 1950 

ggctggtctc tgccgcacac tgctacatcg 2000 

tcagacccca cgcagtggac ggccttcctg 2050 

gcgcagcgcc cctggggtgc aggagcgcag 2100 

accccttctt caatgacttc accttcgact 2150 

ctggagaaac cggcagagta cagctccatg 2200 

ggacgcctcc catgtcttcc ctgccggcaa 2250 

ggggacacac ccagtatgga ggcactggcg 2300 

gagatccgcg tcatcaacca gaccacctgc 23 50 

gatcacgccg cgcatgatgt gcgtgggctt 2400 

cctgccaggg tgattccggg ggacccctgt 2450 

cggatcttcG aggccggtgt ggtgagctgg 2500 

gaacaagcca ggcgtgtaca caaggctccc 25 50 

aagagaacac tggggtatag gggccggggc .2600 

cggggccacc catcgtccac cccagtgtgc 2650 

ggaccgctga ctgcaccagc gcccccagaa 27 00- 

tccagggctc caaatctgcc tagaaaacct 2750 

agtggagctg ggaggtagaa ggggaggaca 2800 

actgggggca aaggtttgaa gacacagcct 2850 

ggccgaggcg cgtttgtgta tatctgcctc 2900 



SEQ 2/17 
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ccctgtctgt aaggagcagc gggaacggag cttcggagcc tcctcagtga 2950 

ag'gtggtggg gctgccggac ctgggctgtg gggcccttgg gccacgctct 3000 

tgaggaagcc caggctcgga ggaccctgga aaacagacgg gtctgagact 3050 

gaaattgttt taccagctcc cagggtggac ttcagtgtgt gtatttgtgt 3100 

aaatgggtaa aacaatttat ctctttttaa aaaaaaaaaa aaaaaaa 3147 

<210> .2 

<211> 855 

<212> ■ ■ PRT 

<213> Homo sapiens 

<220> 

<223> Amino acid sequence of T-ADG-15 encoded by cDNA 

<400> 2 

Met Gly Ser Asp Arg Ala Arg Lys Giy Gly Gly Gly Pro Lys Asp 

5 10 15 

Phe Gly Ala Gly Leu. Lys Tyr Asn Ser Arg His Glu Lys Val Asn 

20 . 25 , 30 

Gly Leu Glu Glu Gly Val Glu Phe Leu Pro Val Asn Asn Val Lys • 

35 .40 45 - 

Lys Val Glu Lys His Gly Pro Gly Arg Trp Val Val Leu Ala Ala 

50 . . 55 60 

Val Leu lie Gly Leu Leu Leu Val Leu Leu Gly lie Gly Phe Leu 

65 70'* .75 

Val Trp His Leu Gin Tyr Arg Asp Val Arg Val Gin Lys Val Phe 

80 85 90 

Asn Gly. Tyr Met Arg lie Thr Asn Glu Asn Phe Val Asp Ala Tyr 

95 100 . 105 

Glu Asn Ser Asn Ser Thr Glu Phe Val Ser Leu Ala Ser Lys Val 

110 115 120 

Lys Asp Ala Leu Lys Leu Leu Tyr Ser Gly Val Pro Phe Leu Gly 

•125 13 0 135 

Pro Tyr His Lys Glu Ser Ala Val Thr Ala Phe Ser " Glu Gly Ser 

140 145 150 

Val lie Ala Tyr Tyr Trp. Ser Glu Phe Ser lie Pro Gin His Leu 

155 160 165 

Val Glu Glu Ala Glu Arg Val Met Ala Glu Glu Arg Val Val Met 

170 175 180 . 



SEQ 3/17 

SUBSTITUTE SHEET (RULE 25) 
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Leu Pro Pro Arg Ala Arg Ser Leu Lys Ser Phe Val Val Thr Ser 

185 190 135 

Val Val Ala Phe Pro Tiir Asp Ser Lys Thr Val Gin Arg Thr Gin 

200 205 210 

Asp Asn Ser Cys Ser Phe Gly Leu His Ala Arg Gly Val Glu Leu 

215 ■ ' ■ 220 • 225 

Met Arg Phe Thr Thr Pro Gly Phe Pro Asp Ser Pro Tyr Pro Ala 

.230 235 . 240 

His Ala Arg Cys Gin Trp Ala Leu Arg Gly Asp Ala Asp Ser Val 

245 . 250 255 

Leu Ser Leu Thr Phe Arg Ser Phe Asp Leu- Ala Ser Cys Asp Glu 

260 265 270 

Arg Gly Ser Asp Leu Val Thr Val Tyr Asn Thr Leu Ser Pro Met 

275 280 285 

Glu Pro His Ala Leu Val Gin Leu Cys Gly Thr Tyr Pro Pro Ser_ 

290 295 300 

Tyr Asn Leu Thr Phe His Ser Ser Gin Asn Val- Leu Leu He Thr 

. 305 310 315 

Leu He Thr Asn Thr Glu Arg Arg His . Pro Gly Phe Glu Ala Thr 

320 325 ■. 33 0 

Phe Phe Gin Leu Pro Arg Met Ser Ser Cys- Gly Gly Arg Leu Arg 

. 335 340 345 

Lys Ala Gin Gly Thr Phe Asn Ser Pro Tyr Tyr Pro Gly His Tyr 

350 355 360 

Pro Pro Asn He Asp Cys Thr Trp Asn He Glu Val Pro Asn Asn 

365 370 375 

Gin His Val Lys Val Ser Phe Lys Phe Phe Tyr Leu Leu Glu Pro 

380 385 390 

Gly Val Pro Ala Gly Thr Cys Pro Lys Asp Tyr Val Glu He Asn 

395 400 405 

Gly Glu Lys Tyr Cys Gly Glu Arg Ser Gin Phe Val Val Thr Ser 

410 . 415 420 

Asn Ser Asn Lys He Thr Val Arg Phe His Ser Asp Gin Ser Tyr 

425 430 435 

Thr Asp Thr Gly Phe Leu Ala Glu Tyr Leu Ser Tyr Asp Ser Ser 

440 445 450 

Asp Pro Cys Pro Gly Gin Phe Thr Cys Arg Thr Gly Arg Cys He 

455 460 465 

SEQ 4/17 
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Arg Lys Glu Leu 
Sex Asp Glu Leu 
Cys Lys Asn Lys 
Val Asn Asp Cys 
Pro Ala Gin Thr 
Ser Gin Gin Cys 
Glu Ala Ser Cys 
Thr Tyr A.rg Cys 
Glu Cys Asp Gly 
Asp Cys Asp Cys 
Val Gly Gly Thr 
Ser Leu Kis Ala 
lie Ser Pro Asn 
Asp Arg Gly Phe 
Leu Gly Leu His 
Glu Arg Arg Leu 
Phe Thr Phe Asp 
Ala Glu Tyr Ser 
Ser His Val Phe 



Arg Cys Asp Gly 
470 

Asn Cvs Ser C/s 
485 

Phe Cys Lys Pro 
500 

Gly Asp Asn Ser 
515 

Phe A^g Cys Ser 
530 

Asn Gly Lys Asp 
545 

Pro Lys Val Asn 
560 

Leu Asn Gly Leu 
575 

Lys Glu Asp Cys 
590 

Gly Leu Arg Ser 
605 

Asp Ala Asp Glu 
620 

Leu Gly Gin Gly 

635 

Trp Leu Val Ser 
650 

Arg .Tyr Ser Asp 
665 

Asp Gin Ser Gin 
680 

Lys Arg lie lie 
695 

Tyr Asp lie Ala 
710 

Ser Met Val Arg 
725 

Pro Ala Gly Lys 
740 



Trp Ala Asp Cys 
475 

Asp Ala Gly His 
490 

Leu Phe Trp Val 
505 

Asp Glu Gin Gly 
520 

Asn Gly Lys Cys 
535 

Asp Cys Gly Asp 
550 

Val Val Thr Cys 

565 

Cys Leu Ser Lys 
580 

Ser Asp .'Gly Ser 
595 

Phe Thr Arg Gin 
610 

Gly Glu Trp Pro 
625 

His lie Cys Gly 
640 

Ala A.la His Cys 
655 

Pro Thr Gin Trp 
670 

.Arg Ser Ala Pro 
685 

Ser His Pro Phe 
700 

Leu Leu Glu Leu 

715 

Pro lie Cys Leu 
730 

Ala lie Trp Val 
745 
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Thr -Asp His 
480 

Gin Phe Thr 
495 

Cys Asp Ser 
.510 

Cys Ser Cys 
525 

Leu Ser Lys 
540 

Gly Ser Asp 
555 

Thr Lys His 

•570 

Gly Asn Pro 
585 

Asp Glu Lys • 
.600 

Ala Arg Val 
615 

Trp Gin Val 
630 

Ala Ser Leu 
645 

Tyr lie Asp . 

660 

Thr Ala Phe 

675 

Gly Val Gin 
690 

Phe Asn Asp 
705 

Glu Lys Pro 
720 

Pro Asp Ala 
735 

Thr . Gly Trp 
750 



SEQ 5/17 
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Gly His Thr Gin Tyr Gly Gly Thr Gly Ala Leu lie Leu Gin Lys 

755 760 765 • 

Gly Glu He Arg VaL He Asn Gin Thr Thr Cys Glu Asn Leu Leu 

770 775 780 

Pro Gin Gin He Thr Pro Arg Met Met Cys Val Gly Phe Leu Ser 

785 790 735 

.Gly Gly Val Asp Ser Cys Gin Gly Asp Ser Gly Gly Pro Leu Ser 

800 805 810 

Ser Val Glu Ala Asp Gly Arg He Phe Gin Ala Gly Val Val Ser 

815 820 825 

Trp Gly Asp Gly Cys Ala Gin Arg Asn Lys Pro Gly Val Tyr Thr 

830 835 840 

Arg Leu Pro Leu Phe Arg Asp Trp He Lys Glu Asn Thr Gly Val 

845 ■ . 850 ■ 855 

<210> 3 
<211> ' 256 
<212> PRT 

<213> Unknown ^ . . 

■■ <220> 

<221> DOMAIN 

<223> Serine protease catalytic domain of hepsin (Heps) 

homologous to similar domain in TADG-15 
<400> 3 . •* 

Arg He Val Gly Gly Arg Asp Thr Ser Leu Gly Arg Trp Pro Trp 

5 10 * 15 

Gin Val Ser Leu Arg Tyr Asp Gly Ala His Leu Cys Gly Gly Ser 

20 . 25 30 

Leu Leu Ser Gly Asp Trp Val Leu Thr Ala Ala His Cys Phe Pro 

35 40 45 

Glu Arg Asn Arg Val Leu Ser Arg Trp Arg Val Phe Ala Gly Ala 

. 50 • 55 60 . 

Val Ala Gin Ala Ser Pro .His Gly Leu Gin Leu Gly Val Gin Ala 

65 70 75 

Val Val Tyr His Gly Gly Tyr Leu Pro Phe Arg Asp Pro Asn Ser 

80 85 . . 90 " 

Glu Glu Asn Ser Asn Asp He Ala Leu Val His Leu Ser Ser Pro 

95 100 105 



SEQ 6/17 



est iBe-rr-ri f-re etjccrr r»i !l c nc% 
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Leu Pro Leu Thr Glu Tyr lie Gin Pro Val Cys Leu Pro Ala Ala 

110 lis 120 

Gly Gin Ala Leu Val Asp Gly Lys lie Cys Thr Val Thr Gly Trp 

125 130 135 

Gly Asn Thr Gin Tyr Tyr Gly Gin Gin Ala Gly Val Leu Gin Glu 

.140 . 145 150 

Ala Arg Val Pro. He He Ser Asn Asp Val Cys Asn Gly Ala Asp 

155 160 165 

Phe Tyr Gly Asn Gin He Lys Pro Lys Met Phe Cys Ala Gly Tyr 

170 175 180 

Pro Glu Gly Gly He Asp Ala Cys Gin Gly Asp Ser Gly Gly Pro 

185 190 195 

Phe Val Cys Glu Asp Ser He Ser Arg Thr Pro Arg Trp Arg Leu 

200 205 210 

Cys Gly He Val -Ser Trp Gly Thr Gly Cys Ala Leu Ala Gin Lys 

215 220 225 

Pro Gly Val Tyr Thr Lys Val Ser Asp Phe Arg Glu Trp He Phe 

230 235 240 

Gin Ala He- Lys Thr His Ser Glu Ala Ser Gly Met, Val Thr Gin 

245 250 . 255 

Leu 

<210> .4 
<211> 225 
<212> PRT 
<213> Unknown 
<220> 

<221> DOMAIN 

<223> Serine protease catalytic domain of Scce 

homologous to similar domain in TADG-^IS, 
<400> 4 

Lys He He Asp 'Gly Ala Pro Cys Ala Arg Gly Ser His Pro Trp 

5 . 10 15 

Gin Val Ala Leu Leu Ser Gly Asn Gin Leu His Cys Gly Gly Val 

20 .25 30 

Leu Val Asn Glu Arg . Trp Val Leu Thr Ala Ala His Cys Lys Met 

35 40 45 



SEQ 7/17 - 
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Asn Glu Tyr Thr Val His Leu Gly Ser Asp Thr Leu Gly Asp Arg 

50 55 -60 

Ar.g Ala Gin Arg lie Lys Ala Ser Lys Ser Phe Axg His Pro Gly 

65 70 . ■ 75 

Tyr Ser Thr Gin Thr His Val Asn Asp Leu Met Leu Val Lys Leu 

80 85 ■ 90 

Asn Ser Gin Ala Arg Leu Ser Ser Met Val Lys Lys Val Arg Leu 

95 100 105 

Pro Ser Arg Cys Glu Pro Pro Gly Thr Thr Cys Thr Val Ser Gly 

110 115 120 

Trp Gly Thr Thr Thr Ser Pro Asp Val Thr Phe Pro Ser Asp Leu 

.125 130 135 

Met Cys Val Asp Val Lys Leu lie Ser Pro Gin Asp Cys .Thr Lys 

140 145 ' ■ 150 

Val Tyr Lys Asp Leu Leu Glu Asn Ser Met Leu Cys Ala Gly lie 

155 160. " 165 

Pro Asp Ser Lys Lys Asn Ala Cys Asn Gly Asp Ser Gly Gly' Pro 

170 175 180 

Leu Val Cys Arg Gly Thr Leu Gin Gly Leu Val Ser Trp Gly Thr 

185- 190 195 

Phe Pro Cys Gly Gin Pro Asn Asp Pro Gly Val Tyr Thr Gin Val 

■ 200 205 210 

Cys Lys Phe Thr Lys Trp lie Asn Asp Thr Met Lys Lys His Arg 

220 225 





215 


<210> 


5 


<21I> 


225 


<212> 


PRT 


<213> 


Uiiknown 


<220> 




<221> 


• DOMAIN 


<223> 


Serine i 



(Try) homologous to similar domain in TADG-15 - 
<4.00> 5 

Lys lie Val Gly Gly Tyr Asn Cys Glu Glu Asn Ser Val Pro Tyr 

5 10 ■ • ■ 15 

Gin Val Ser Leu Asn Ser Gly Tyr His Phe Cys Gly Gly Ser Leu 

20 25 30 

SEQ 8/17 
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Ila Asn Glu Gin Trp Val Val Ser Ala Gly His Cys Tyr Lys Ser 

35 40 ^5 

Arg He Gin Val Arg Leu Gly ^Glu His Asn He Glu Val Leu Glu 

50 55 ^0 

Gly Asn Glu Gin Phe He Asn Ala Ala Lys He He Arg Kis Pro 

70 75 



Gin Tyr Asp Arg Lys Thr Leu Asn Asn Asp He Met Leu He Lys 

80 85,. 90 

Leu Ser Ser Arg Ala Val He Asn Ala. Arg Val Ser Thr lie Ser 

95 100 105 

Leu Pro Thr Ala Pro Pro Ala Thr Gly Thr Lys Cys Leu He Ser 

110 115 120- 

Gly Trp Gly Asn Thr Ala Ser Ser Gly Ala Asp Tyr Pro Asp Glu 

125 . 
Leu Gin Cys Leu Asp Ala Pro Val Leu Ser Gin Ala Lys Cys Glu 

140 145 150 

Ala Ser Tyr Pro Gly Lys He Thr Ser Asn Met Phe Cys Val Gly 

155 160 165 

Phe Leu Glu Gly Gly Lys Asp Ser Cys Gin Gly Asp Ser Gly Gly 

170 175 . 180 

Pro Val , Val Cys Asn Gly Gin Leu Gin Gly Val Val Ser Trp Gly 

185 190 195 

Asp Gly cys Ala Gin Lys Asn Lys Pro Gly Val Tyr Thr Lys Val 

200 205 210 

TVr Asn Tvr Val Lys Trp He Lys Asn Thr He Ala Ala Asn Ser 

220 225 





215 


<210> 


6 


<211> 


231 


<212> 


PRT 


<213> 


Unknown 


<220> 




<221> 


DOMAIN 


<223> 


Serine 




(Chymb) 


<400> 


6 



Arg He Val Asn Gly Glu Asp Ala Val Pro Gly Ser Trp Pro Trp 

5 . 10 ■ 15 



SEQ 9/17 
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Gin Val Ser Leu Gin Asp Lys Thr Gly Phe His Phe Cys Gly Gly 

20 25 ^O- 

Ser Leu He Ser Glu Asp Trp Val Val Thr Ala Ala His Cys Gly 

35 • • 40 . '45 

Val Arg Thr Ser Asp Val Val Val Ala Qly Glu Phe Asp Gin Gly 

.50 55 ■ ' . 

Ser Asp Glu Glu Asn He Gin Val Leu Lys He Ala Lys Val Phe 

65 10 -75 

Lys Asn Pro Lys Phe Ser He Leu Thr Val Asn Asn Asp lie Thr 

80 85 90 

Leu Leu Lys Leu Ala Thr Pro Ala Arg Phe Ser Gin Thr Val Ser 

95 100 ■ 105 

Ala Val Cys Leu Pro Ser Ala Asp Asp Asp Phe Pro Ala Gly Thr 

110 115 120 

Leu Cys Ala Thr Thr Gly Trp Gly Lys Thr Lys Tyr Asn Ala Asn 

125 ■ 130 ■ . • 135 

Lys Thr Pro Asp Lys Leu Gin Gin Ala Ala Leu Pro Leu Leu Ser 

140 145 150 

Asn Ala Glu Cys Lys Lys Ser Trp Gly Arg Arg He Thr Asp Val 

155 ■ 160 165 

Met He Cys Ala Gly Ala Ser Gly Val Ser Ser Cys Met Gly Asp 

170 175 ISO 

Ser Gly Gly Pro Leu Val Cys Gin Lys Asp Gly Ala Trp Thr Leu 

185 .190 195 

Val Gly He Val Ser Trp Gly Ser Asp Thr Cys Ser Thr Ser Ser 

200 205 ■ 210 

Pro Gly Val Tyr Ala Arg Val Thr Lys Leu He Pro Trp Val Gin 

215 220 225 

Lys He Leu Ala Ala Asn 





230 


<210> 


7 


<211> 


255 


<212> 


PRT 


<213> 


UnJcnown 


<220> 




<221> 


DOMAIN 


<223> 


Serine 



(Fac7) homologous, to similar domain in TADG-15 
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<400> 7 

Arg lie Val- Gly Gly Lys Vai Cys Pro Lys Gly Glu Cys Pro Trp 

5 10 15 

Gin Val Leu Leu Leu Val Asn Gly Ala Gin Leu Cys Gly Gly Thr 

'20 .25 -30 

Leu He Asn Thr He Trp Val Val Ser Ala Ala His Cys Phe Asp 

35 40 45 

Lys He Lys Asn Trp Arg Asn Leu He Ala Val Leu Gly Glu His 

50 .55 -60 

Asp Leu Ser Glu His Asp Gly Asp Glu Gin Ser Arg Arg Val Ala 

65 70 .75 

Gin Val He He Pro Ser Thr Tyr Val Pro Gly Thr Thr Asn His 

80 85 90 

Asp He Ala Leu Leu Arg Leu His Gin Pro Val Val Leu Thr Asp 

95 100 . 105 

His Val Val Pro Leu Cys Leu Pro Glu Arg Thr Phe Ser Glu Arg 

110 115 ■ 120 

Thr Leu Ala Phe Val Arg Phe Ser Leu Val Ser Gly Trp Gly Gin 

125 130 
Leu Leu Asp Arg Gly Ala Thr Ala Leu Glu Leu- Met Val Leu Asn 

140 145 150 

Val Pro Arg Leu Met Thr Gin Asp Cys Leu Gin Gin Ser Arg Lys 

155 . 160 165 

Val Gly Asp Ser Pro Asn He Thr Glu Tyr Met Phe Cys "Ala Gly 

170 175 1.80 

Tyr Ser Asp Gly Ser Lys Asp Ser Cys Lys Gly Asp Ser Gly Gly 

185 190 195 

Pro His Ala Thr His Tyr Arg Gly Thr Trp Tyr Leu Thr Gly He 

200 205 210 

Val Ser Trp Gly Gin Gly Cys Ala Thr Val Gly His Phe Gly Val 

215 220 225 

Tyr Thr Arg Val Ser Gin Tyr He Glu Trp Leu Gin Lys Leu Met 

230 235 240 

Arg Ser Glu Pro Arg Pro Gly Val Leu Leu Arg Ala Pro Phe Pro 

245 250 255 

<210> 8 
<211> 253 



SEQ 11/17 
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<212> ?RT 
<213> Unknown 
<220> 

<221> DOMAIN 

<223> Serine protease catalytic dcmain of tissue 

■ plasminogen activator (Tpa) homologous to 
similar domain in TADG-15. 
<400> 8 

Arg lie Lys Gly Gly Leu Phe Ala Asp lie Ala Ser His Pro Trp 

5 10 15 

Gin Ala Ala lie Phe Ala Lys His Arg Arg Ser Pro Gly Glu Arg 

20 25 .30 

Phe Leu Cys Gly Gly lie Leu He Ser Ser Cys Trp He Leu Ser 

.35 40 45 

Ala Ala His Cys Phe Gin Glu Arg Phe Pro Pro His His Leu Thr 

50 55 . 60 

Val He Leu Gly Arg Thr Tyr Arg Val Val Pro Gly Glu Glu Glu 

65 70 75 

Gin Lys Phe Glu Val Glu Lys Tyr lie Val His Lys Glu Phe Asp 

80 85 • . ■ . 90 

Asp Asp Thr Tyr Asp Asn Asp He Ala Leu Leu Gin Leu Lys Ser 

95 ■ . 100 105 

Asp Ser Ser Arg Cys Ala Gin Glu- Ser Ser Val Val Arg Thr Val 

110 lis 120 

Cys Leu Pro Pro Ala Asp Leu Gin Leu Pro Asp Trp Thr Glu Cys 

125. 130 135 

Glu Leu Ser Gly Tyr Gly Lys His Glu Ala Leu Ser Pro Phe Tyr 

140 . 145 • 150 

Ser Glu Arg Leu Lys Glu Ala His Val Arg Leu Tyr Pro Ser Ser 

155 160 , . ■ 165 

Arg Cys Thr Ser Gin His Leu Leu. Asn Arg Thr Val Thr Asp Asn 

170 175 180 

Met Leu Cys Ala Gly Asp Thr Arg Ser Gly Gly Pro Gin Ala Asn 

185 •■ 190 ' ■ . 195 

Leu His Asp Ala Cys Gin Gly Asp Ser Gly Gly Pro Leu Val Cys 

200 205 210 
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Leu Asn Asp Gly Arg Met Tiir Leu. Val Gly lie lie Ser Trp Gly 

215 220 225 

Leu Gly Cys Gly Gin Lys Asp Val Pro Gly Val Tyr Tkr Lys Val 

230 • ■ 235 240 

■ Thr Asn Tyr Leu A-sp Tro lie Arg Asp Asn Met Arg Pro 

245 " 250 

<210> 9 
.<211> 2900 
<212> DNA 
<213> Homo sapiens 

<220> 

<223> SNC19 mRNA sequence (U20428 ) 

'<400> 9 ■ 

cgctgggtgg tgctggcagc cgtgccgatc ggcctcctct tggtcttgct 50 
ggggatcggc ttcctggtgt ggcatttgca gtaccgggac .gtgcgtgtcc 100 
agaaggtctt caatggctac atgaggatca caaatgagaa ttttgtggat 150 
gcctacgaga actccaactc cactgagttt gtaagcctgg ccagcaaggt 200 
gaaggacgcg ctgaagctgc tgtacagcgg agtcccattc ctgggcccct 250 
accacaagga gtcggctgtg acggccttca gcgagggcag cgtcatcgcc 3 00 
tactactggt ctgagttcag catcccgcag cacctggttg aggaggccga 3 50 
gcgcgtcatg gccaggagcg cgtagtcatg ctgcccccgc" gggcgcgctc 400 
cctgaagtcc ■ tttgtggtca cctcag.tggt ggctttcccc acggactcca 450 
aaacagtaca gaggacccag gacaacagct gcagctttgg cctgcacgcc 500 
gcggtgtgga gctgatgcgc ttcaccacgc cggcttccct gacagcccct 550 
accccgctca tgcccgctgc cagtgggctg cggggacgcg acgcagtgct 600 
gagctactcg agctgactcg cagcttgact gcgcctcgac gagcgcggca 650 
gcgacctggt gacgtgtaca acaccctgag ccccatggag cccca'cgcct 700 
ggtgagtgtg tggcacctac cctccctcct acaacctgac cttccactcc 750 
ctcccacgaa cgtcctgctc atcacactga taaccaacac tgacgcggca 800 
fccccggcttt gaggccacct tcttccagct gcctaggatg agcagctgtg 850 
gaggccgett acgtaaagcc caggggacat tcaacagccc ctactaccca 900 
ggccactacc cacccaacat tgactgcaca tggaaaattg aggtgcccaa. 950 
caaccagcat gtgaaggtgc gcttcaaatt cttctacctg ctggagcccg 1000 
gcgtgcctgc gggcacctgc cccaaggact acgtggagat caatggggag 1050 
aaatactgcg gagagaggtc ccagttcgtc gtcaccagca acagcaacaa 1100 
gatcacagtt . cgcttccact cagatcagtc ctacaccgac 'accggcttct 1150 
tagctgaata cctctcctac gactccagtg acccatgccc ggggcagttc 1200 
acgtgccgca cggggcggtg tatccggaag gagctgcgct gtgatggctg 1250 

SEQ 13/17 
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ggcgactgca ccgaccacag cgatgagctc aactgcagtt gcgacgccgg 13 00 

ccaccagttc acgtgcaaga gcaagttctg caagctcttc tgggtctgcg 1350 

acagtgtgaa cgagtgcgga gacaacagcg acgagcaggg ttgcatttgt 1400 

ccggacccag accttcaggt gttccaatgg gaagtgcctc tcgaaaagcc 1450 

agcagtgcaa tgggaaggac gactgtgggg acgggtccga cgaggcctcc 1500 

■tgccccaagg tgaacgtcgt cacttgtacc aaacacacct accgctgcct 1550 

caatgggctc tgcttgagca agggcaaccc tgagtgtgac gggaaggagg 1600 

actgtagcga cggctcagat gagaaggact gcgactgtgg gctgcggtca 1650 

ttcacgagac aggctcgtgt tgttgggggc acggatgcgg atgagggcga 17 00 

gtggccctgg caggtaagcc tgcatgctct gggccagggc cacatctgcg 1750 

gtgcttccct catctctccc aactggctgg tctctgccgc acactgctac 1800 

atcgatgaca. gaggattcag gtactcagac cccacgcagg acggccttcc ■ 1850 

tgggcttgca cgaccagagc cagcgcaggc cctggggtgc aggagcgcag 1900. 

gctcaagcgc atcatctccc accccttctt caatgacttc accttcgact 1950 

atgacatcgc gctgctggag .ctggagaaac cggcagagta cagctccatg 2000 

gtgcggccca. tctgcctgcc ggacgcctgc catgtcttcc ctgccggcaa 2050 

ggccatctgg gtcacgggct ggggacacac ccagtatgga ggcactggcg 2100 

cgctgatcct gcaaaagggt gagatccgcg tcatcaacca gaccacctgc 2150 

gagaacctcc tgccgcagca gatcacgccg cgcatgatgt gcgtgggctt 2200 

cctcagcggc ggcgtggact cctgccaggg tgattccggg ggacccctgt 2250 

ccagcgtgga ggcggatggg cggatcttcc aggccggtgt ggtgagctgg 23 00 

ggagacgctg cgctcagagg aacaagccag gcgtgtacac aaggctccct 23 50 

ctgtttcggg aatggatcaa agagaacact ggggtatagg ggccggggcc 2400 

acccaaatgt gtacacctgc ggggccaccc atcgtccacc ccagtgtgca 2450 

cgcctgcagg ctggagactc g.cgcac.cgtg acctgcacca gcgcGCcaga 2500 

acatacactg tgaactcatc tccaggctca aatctgctag aaaacctctc 2550 

gcttcctcag cctccaaagt ggagctggga gggtagaagg ggaggaacac 2 600 

tggtggttct actgacccaa ctggggcaag gtttgaagca cagctccggc 2650 

agcccaagtg ggcgaggacg cgtttgtgca tactgccctg ctctatacac 2700 

ggaagacctg gatctctagt gagtgtgact gccggatctg gctgtggtcc 2750 

ttggccacgc ttcttgagga agcccaggct cggaggaccc tggaaaacag 2800 

acgggtctga gactgaaaat ggtttaccag ctcccaggtg acttcagtgt 2850 
gtgtattgtg taaatgagta aaacatttta tttcttttta aaaaaaaaaa 2900 

<210> 10 
<211> 20 
<212> DNA 

<213> Artificial Sequence, 

<220> 

SEQ 14/17 
c;nnc:TrrTiTP crwrrrrrrpiin p !?B^ 
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<221> 


primer 


<223> 


Forward primer for .analysis of 




of TADG-15 mRNA. by quantitative 


<400> 


10 


atgacagagg 


attcaggtac ■ "20 


■ <21C>> 


11 


<211> 


20 


<212> 






ruTUXi. J.CXaj. ocCJLlcXlCc 


<220> 




<221> 


primer 


<223> 


Reverse orimer for analvsis of c 




of TADG-15 mRMA by quantitative 


<400> 


11 


gaaggtgaag 


tcattgaaga 20 


<210> 


12 


<211> 


17 . 


■ <212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<221> 


primer 


<223> 


Forward primer for analysis of P 




expression by quantitative PGR. 


<400> 


12 ' 

* 


tgcattgaca acgaggc 17 


<210> 


13 


<211> 


17 


. <212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<221> 


primer 


<223> 


Reverse primer for analysis of 




expression by quantitative PGR. 


<400> 


13 ■ 



:tgtcttgac attgttg 17 



SEQ 15/17 

qUBSTlTTlTP SHPPT mULP 26^ 
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<210> 14 

<211> 242 

<212> PRT 

<213> Homo ssLviens " ' 

<220> 

<221> DOMAIN 
■ <223> Serine protease catalytic domain of TADG-IS. 

<400> 14 

Arg Val Val Gly Gly Thr Asp A-la Asp Glu Giy Glu Trp Pro Trp 

■ 5 10 15 

Gin Val Ser Leu His Ala Leu Gly Gin Gly His lie Cys Gly Ala 

20 .25 - 30 

Ser Leu lie Ser Pro Asn Trp Leu Val Ser Ala Ala His Cys Tyr 

35 40 45 

lie Asp Asp Arg Gly Phe Arg Tyr Ser Asp Pro Thr Gin Trp Thr 

50 55 * 60 

Ala Phe Leu Gly Leu His Asp Gin Ser Gin Arg Ser Ala Pro Gly 

65 .70 75 

Val Gin. Glu Arg Arg Leu Lys Arg lie lie Ser His Pro Phe Phe 

80 85 . 90 

Asn Asp Phe Thr Phe Asp Tyr Asp lie Ala Leu Leu Glu Leu Glu 

95 100 105 

Lys Pro Ala Glu Tyr Ser Ser Met Val Arg Pro lie Cys Leu Pro 

110 115 120 

Asp Ala Ser His Val Phe Pro Ala Gly Lys Ala lie Trp Val . Thr 

125 ,130 13 5 

Gly Trp Gly His Thr Gin Tyr Gly Gly Thr Gly Ala Leu lie Leu 

140 145 150 

Gin Lys Gly Glu lie Arg Val lie Asn Gin Thr Thr Cys Glu Asn 

155 160 165 

Leu Leu Pro Gin Gin Il'e Thr Pro Arg Met Met Cys Val Gly Phe 

170 175 180 

Leu Ser Gly Gly Val Asp Ser Cys Gin Gly Asp Ser Gly Gly Pro 

185 190 195 

Leu Ser Ser Val Glu Ala Asp Gly Arg lie Phe Gin Ala Gly Val 

200 205 210 

Val Ser Trp Gly Asp Gly Cys Ala Gin Arg Asn Lys Pro Gly Val 

215 220 225 

SEQ 16/17 
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Tyr Thr Arg Leu Pro Leu Phe Arg,Asp Trp lie Lys Glu Asn Thr 

230 235 240 

Gly Val 
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MQSnnM^^GG GGPK C'FGAC L K.YM3R}-]EXVM GhEZ.QVE.ELF VNUVKKVEKH] j 



FV DAY Zt-1 ^iS Tfc FV S LA^JfiV K DALK Li. i' V P f L'S PY H K E .S ft V ThFSEC S 
VIAYYWSEF5 I PO^iLVt'EAE RVMAEERVVM 1.PPK.W.LK5 F"v'ViT5VVAFP 



T-DSK'I-VQHrQ DMS) CSFGL«A RG^/ELMRPTT FtSlfi^DSl^^YPA HARCQWALRG 

DAJD'fiVLSLrF- RSFDLASCDB KCSDLVTV'i'N TLSPMRPHAL VQLCC'I'YfPS 

i^Lr|fKSJ5QW VLLITLITNT iilRlr^K^FEAT FFQT.PftHSSC GGRLi^KAQGT 

K'WSPYYPGHY PPMI DCTWiN f. FIVPNNC'KVKV 3FKFFYLLEP OVFAGTCPKD 



lYVEIb-sahlKYr: GERSC'FVVTS MabJKITVRFH 5DQSYTDTGF LASYLJ^Y DS 



DP 



3PGQFTCR TOP?CiKKEl.R CDGWADCTDtl ^Dp^NCSCDA GHQFTCKWKf 
CKFLlrWyCDS VMDCGDH (SD£] QGCSCPAQTT R.CSMGKGL5K SOOCWGKDDC 

Gn c^DEj\scp KVH WnrrKK tyrclnglcl gkojphcdc^k EDcscdsDik 

M^^M """"" , . ^ ■—■■■I _ -W**^^™.— 



rJC|E>CC L ?3 i"! RQ AR VVGQT D ADEG li W PWg V S -Si T..GQG H I CQii S'.L I S PNW 



LVSAj!^:Yirj DRGFRY3DPT QWTAFJ.GLHD QSQRSAPGVQ ilRRLKRIISH 
PFFTJDFTFDY ©I/iLLELKKP AEYSSMVRPI CLPDAlJJlVfP AGKAIWVTGW 
GHTQYGGTGA LILQKGEIRV r>]QTTCENLL PQQll'PRW^lC VGFLSGGVDS 

nS ?3 G PLS a ve:a dgb I FO AG VVSWG ; i: aqe m: pgv y t rl PL fr dw i k 
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(SEQ. ID NO: 2) 



^ ^ i Conserved cysteine residue 

Possible N-linked glycosyladoQ site 



^CEJ i Conserved SDE nnjtif 
Q : Potential cleavage site 

; Conserved amino adds of catajytic triad H» D, S 

1. Cy to p lasua i c cf om ai ti 

2. Transmembrane domain 

3. CUB repeat 

4. Ligand-binding repeat (class A motif) 
of LDL receptor like domain 

5. Serine prot«a5e 

FIG. 10 
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SEQUEHCE LISTING 

<liC> O'Brien, TiiTKDthy J. 

Tgniinoto, HirotosJii 

<i20> . TAC*G-15: Ail Sxtrac'-^llular St^-riris Protease 

<140> . FCr/DSSS'/0343S 

<141:> 1393-02 -IS 

<150> US 03 / 527 r 337 

<151> 1993-02-20 

<160> 14 . 

<:2:10> .1 
<211> 3147 
<212> mA 

<223^ cn-'IA a&£jaence. q£ TADS- 15 

tc-i?t5.g^gc5g cctcggggta cc a t-gg^j^j^ag cgatcgggcc cg-c^aaggg-og 50 

gagggggccc ga^iiggacttc: ggetrcgggiiir^ br^aAgt^CitA ct<2'S!-i:gg«r!AC IQQ 

g^LgasLaLgrtg^ - &tggc:ttgg3 ggasaggcgtg gagttcctgc cagtcaac^ia 150 

cgtc^agiaag gtgga^aagc stggcocggg gcgctgggtg gtigctggcsQ 200 

ccijcgctgat. cgg-r^ctctr^tc ttggtcttgc? tgggg'Sfccgg cttcctggtg 350 

t:gg.22Ltttgc ^.gtaccggga cgtgcgtgtc. cagasggtcc t.caatggcta 3 00 

C9.tg-5igg^.tc acaaacgaga at&ttgtgga tgdcta.rjgcig tiac tec aact 350 

ccactgsgrt. tgtaag-sctg gcc^agcia^gg tg&aggacgc gctgaagctg 400 

ctgcscagcg igAgtc-cc-aSitt ci^tgggcccq taccacaagg agtcggctgt 450 

gacgeccttc atgcgagggca gcgtcatcgc ct^cr-acrgg tctgagtt-ca 500 

g-s:atcccgca go^catggtg gaggaggccg -stgcgcgtcat ggccgagg^g 550 

cgcgtagcca tgrjtgc^cccc gcgggcgcgc tccctgaagt cctttgtggt 6 00 

caccucagtg gtggctttcc ccacggacto csaaacagta ca^gaggacac 6 50 

iggaca*cag ctgcagcttt ggcctgcacg cccgcggtgt gg^gctgatg 7 00 

cgcttcacca cgccdggctt ccc-tgacagc ccctaccccg ccc-atgcccg 75D 

ctgccagtgg gccctgcggg gggacgccga ctcagtgctg ag-cctcac^Jt 800 

tccgcagctt t^accttrgcg tcctgcgacg $gcgcggc&g cgacctggtg 850 

■acggtgtaca aci^-:::ci^tg5ig ccc^catggag ccccacgccc tgetgca^tt .900 

gcgtggca'irc t^iccctcccc cctac:aac::.ct gaccttcr^sic:' tcctcccagr-^ 950 

acgtc='?tgct caccacactg .staaeca&c^i ctgagcggcg gc^itcccggc 1000 

SBQ 1/17 



ttt:g.it-ggcc£2 cdttcttcca gctgcctagg 5.tg^i-gcagct gtggsggccg 105 0 

cttacgts^^^ gcccag-gg-g^ cs-ttcsacag cccctactac ccaagccact llOO 

acccacccae. cactgact^c acaccfgeaca ttgaggtigco caacfiaccag 1150 

catgtgiSi^g'S. tg^ig-cittcaa attctbrjtac ctg-ctgg&gc ccger^gtg-inc: 120C- 

tgcgggc&cc tgc-cJcca-agg ^ct^cgtgga gatcaatggg q^g^^^tB.ct 1250 

gcgga-gag^g gtcccagttc gt eg t caeca gcaacagcaa csagatciaca 1300 

gtucgcttcc aot^agatca gccctacacc gacaccggct. C-cttagctga 1350 

^t^cct'-tc:^::! t^-£!gac:tcca gtgd^c-rrcatg cccggggcag ttcacgtgcc 1400 

i:fcacggggcg gtgt^^tccgg a^iggagctgc gctgtgetgg .rtggg-idgac: 1450 

tgc^ccgacc s.-iiagcgatga gcceaa^^tgc agt-tgcgacg ccggccacca 1500 

gctcacgtgc ^^g^acaagt tctgcaa-gcc cmtcttctgg gt-r^tg-agac^ 15S0 

gtgtgaacea ctgcggagac aac^igcgaog sgcaggggtg c^gttgtQcg 1600 

gcccagacct tc!.iicgtgttci caatgggaag tgcctctoga assgocagca 1650 

gtgcaatggg ci^ggacgact gtgggg^cgg gtccgaegag gccstciitgcc 1700 

ccaawt:g-a^-3 cgtcgtcact tgtsccaaac .acacctaccg ctgcctca^t 1750 

gggctctgct tgagcaaggg caaccct-gag tgtgacggga aggaggactg 1300 

t9.gcgacggc tc^gatgaga aggi:Ctgcg& ctgtgggctg cggtc^ittCii 1650 

cgagacaggc tcgtgttgtt asf^ff^'-^^^^^sr acgcggatga gggcgagtgg ISOO 

cDdtg^dagg tiiAg.cctgc:^! tgctat^ggc Cig^sgccaca totgcg^gtgc 1350 

ttccctcatc tctccc^L^ct ggctggtcto tgccgcacac tgctacatdg 2000 

acgacagagg atccsggtac tcagacccca cgcsgtg^ac ggccttvctg 2050 

ggczzgc^cg accagsgcca gcgcagogcc cctggggt.gc aggagcgcag 2100 

gcitcaiii-gt::gc atc^&tctcrcc iiccccttctt CAAbgacttc accttcgacrt 2150 

^tgacatcgc gctgctggag ctgg^ig^^aac cggcagagta cagctccatg 22 00 

gtgoi^TCCca tctgcczgcc ggacgcoccc catgtcttcG ctgccggcaa 2250 

gcrcir^jtcrtgg gtcacgggct ggrggacacac cc^gtatgg^ ggca'^t^ggcg 23 00 

cgctgatcct gcaaasgg-gt gagatcc^rcg tcaccaacca gaccacctgc 2350 

gagaacctcc tgccgcagca gat-cacgccrg cgcatgatgt rjrcgtggQ.Dtt: 2400 

cotcag-sggc ggcgtgg.3i-£t cctgccaggg tgattccggrg ggacccctgt 24 50 

o.c.2i^ccirtgga ggcggatggg cggatcttcc aggccggtgt ggi^gagctgg 2500 

g.ga.gacggct gcgctcaga^T gaacaagcca ggci^t.gtaaa caaggc»:ccc 2550 

tctgtctogg gactggatca aagagaacae tggggts^t^g gggccgqggc 2600 

ca-2CC3!.a.iitg tgtacacctg cggggccacc catcgtccac cccagtgt-r;c 2650 

^cgcctgcati gctggacjaot: ggaccgctga ctgcadrj&.ga ^ccr-tfc^gaa 2? DO 

catacaotgt gaactea^tcr tcciigggctc caa.?:tc:tgcc tagai^aac-ct 2750 

■ntcgcttcct cagcctccaa ^gtgga^ctg ggaggtagaa ggggaggeca 2SC50 

ctggtggttc taccgaocca actgggegca aagt^tttgeua gaezadagcct 23S0 

cccccgccag cccc^£tgctg ggccgarjgcg cgtttgtgta t^tctgcctc 2300 

SBQ mi 



ccctgtctgt s^iagagrcagc ggg-^-acggaa ctt-iigg^gcc tcctcagtga 2350 

aggtrjgcggg gcrgccggac ctgggctgtg gggcc^-^ttttg gccacgctct 3000 

hs^ggaagcc caggctcgija ggaocctgga s^ac^gacgg gtct^iig-^^ct 3D50 

gaidiattgttt ta^ic^gctcc eigggtggac ttcagtgtgt gtatttgtgt 3100- 

aaatgggtaa -sacaatttac ttctttttii^ ^si^aaaaaad aaaaa.g.a . 3147 

<210> 2 
<.211> 

<212> PBJ 
<220> 

<223> Amino ^cid e-^iqiufiiiGe of TATG-IS encode-d' cDiaA- 

Met. Gly 2er Asp Axg Arg Lys Giy CUy Gly Giy Paro Lys Asp 

• 5 iO . 15 

pji*^ Gly Ala Gly L.^w Lyis ^yr Asn Ser Arg His OLu Lye Vaa Asn 

20 25 30 

Gly L«u Glu Giu Gly Val Giu Phe Lau Pxo v^iil A^n A:sn val Lys 

35 4D . ^5 

Lys V^l Olu Lys h^ie Qly Pro Gly Arg Trp Vsl Val Le-u Aia Ala 

50 ' 55 ' 60 

val Leu 11^ Gly Leu Leu Leu Val Li?u Leu Gly lis Gly Ph^ Leu 



g5 7 0- 



75 



Val T^rp Hie L^5U Glxi Tyx Arg A^ip Val Arg V«l Gill Lys -Val Pha 

SO «5 £?0 

Asn Gly ''ryr- wet Ax-g lie T^ir Asn Giu A en Ph« Val A-gip Ala Tyx 

95 100 105 

Glu Aan Ser A^n S^x Thr Qlu Pina Val S<ar Lh^u Ala Ser Lys Val 

110 ai5 130. 

L\.^S Asp Al.a Leu Lys Leu L^a I^tTT 3^ Gly V^l' Pro Phe LeU Gly 

135 1:3 0 135 

Pxo T-iTT His Lys Glu Ser Ala Val Tiir A Is Phe £-ar Glu Giy S^r 

140 145 . 150 

Val I1& Ala -i^'r Ti-o: Trp S^r Glu Phe Ser II^ P^'-' 'Gin Hiig: L«u 

1$0 1S5 



Val Glu Ala Glu Arg Val Met: Ala Glu Glu Arg- V&l Val Met 

170 175 180 



SEQ 3/17 



L6>u Fro Pro Arg Ala Arg Ser 

IBS 

V^X V5I Pro Tiir Agp 

200 • 

Aep Asn Sar Oys. .S^ix FAtij Gly 

.2-15 

Met Arg PJifi Tlir Thr- Pro Gli^ 

Kis Ala Arg Cy*5 C3ln Trp' Ala. 

245 

Leu S^r L^u -thx Fhe Axg Ser 

Ai-g Giy Ser Asp Leu Vsl 'I^ix 

Ti T c: 

Glu Pro Kis *-.l5: Lau Val Gin 

.230 

T-^^t: Asn L«u Tbr Phe His Ser 
Leu lie Thr Asn Tin: Glu Arg 

"1 A 

Phe Phe Qln Leu Pro Ar.3 Met 

Lys x\la Glrx Gly Thx- A011 

350 

Pro PrD Asn- lie Asp Cy2 Thr 

•Gin His Val Lys vai Ser Phe 

Qly V2il Pro AI3 Qly Thr Cy^ 

Gly Glut Lys Tyr Cys Gly Glu 

410 

Aen Sar Asn Lys lie Thr Val 

425 ■ 

Tl-iX A3P Thr i^ly P'ne Leu Aia 

Asp Pro Cy^' Pre Gly Gin Phe 

4S5 



Leu LvG Ser Fhe Val Vsl Thr S-er 







130 










135 






Thr 


V£ll 


Gin Arg 


Thr 


Giri 
















210 


L'£U 


Hi2S 


Al^a. 


Arg 


Giy 




Glu 








AjSP 




Pro 


Ti/r 


Px-o 


U K« b* 






235 










•24 D 


Leu 


Arg 


Qly 






Asp 


Ser 








2 5D 


■ 








255 


Pile 




L-e-u 






Cyn: 




Glu 






265 










27 C 




•Tyr 




Thr 


L^Cs'ti 


Ser 


Pro 


Met 






230 
















^3ly 


Thr 




Fro 


Pro 


ber 






4^ v* v' 










2- 'DO 




Gin 


Asn 


V^lI 




Leu 


I1& 


Thr 






310 










315 


Arg Hi 3 


Pro 


Gly 


Phw 


Glu Ala 


TJir 
















330 




S«sr 


Cys 


C31y 


Gly .?*rg tn^v Arg 






3d0 










3-45 


S'er 


Pro 




Tyx 


Pro 


Gly His 


Tyr 






355 










260 


Trp 


A^ccti 


11^ 


G-lu 


Val 


Pro 


Aeri 








3 70 












Lys 


Phe 


Fhe 


Ty3: 


Leu 


Leu 


Glu 


Prg 




















Ly;^ 


Asp 




V£il 


GlQ 


He 








^00 










.4C-5 


Arg 


Stir 


Gin 


Phe 


val 


Val 


Thr 


Ser 






4X5 










420 


Arg 


Fhe 


His 


Ser A^- 


Glrj 


Ser 


•r>'-j; 






430 










43 5 




Tyr 


•Le.u 




Ty-^ 


Abp 










445 










.450 


Thx 


Cye 


Ar--g 


Thr 


Gly 


Arc? 


jr 


II45 






d.60 










465 



SEQ 4/n 



wo 991^42120 



Arg hys Glu L-au Arg 

47 0 

Ser Asp Glu I^eu Asn 

C:;.':^- Ly^ Aan L-yg Fh-a 

Val Asn Asp Cya Gly 

• S15 

Pro Ala. Glr. Thx Pfcie 

. 530 

S^p-x Glxi Glu Cys ?v.5n 

545 

Glu Alii Ser O/d Pro 

Thr Tyr Arg Cys Leu 

57S 

'Glu' Cys A^:p QiY Ly3 

530 

Asp O/S Asp €yi^ Gly 

Val Gly c51y 'TTni: Asp 

620 

Ser Lrtsu Hi:^^ Ala L*u 

Ili=: Sex Pxo Asri Trp 

Asp Arij Gly P-tae Airg 

665 

Leu Qly LSU His A^- 

. Glu Arg Axg L^u Jsys 

635 

E'he TJir Phe Asp 'Tyx 

O ^} i-v, 

Ala Giu Tyr S^r S^x 
Ser Hi$ Val Phis Pro 



Cyi3 Asp Gly Trp AI3. 

Cys Ser Oys Asp Ala 

45 0 

Cy3 Lys ?ro Leu PhtS 

505 

As'p Asn Se-z Asp Ola 

520 

hxg Oys Ser Asa Gly 

535 

Gly Lys A;=p Asp CV^ 

550 

Ly:5 Viil A£.n V^.l Vai 

Asn Gly Lena O/s^ Leu 

. 530 

Gl-a Aep S'csr Abp 

595 

Lau Arg S6r P'li^ Thr 

SIO 

Ala Aep Glu Qly Glu. 

Gly Glrj Gly Hi^ ll-e 

•540 

L»u val JSar. Ala Ala 

-655 

•I^^r S-sr A^p Pro WiX 

610 

Gin Siiir Gill Arg S^x 

6B5 

Ar-g lie ll-s S^x- His 

70 '3 

Abp 11^5 A-l-a ILieu Leu 

715 

Mt2t val Ars Pro 1-1$ 

730 

Ala CUy Lys ALa lie 



Gly His Gin ?ha Thx 

4.S5 

Trp Val Cy^ A;«p S-sx 

510 

Gin Gly Cys Ser Cy^ 

525 

LyiS Cys leu S-er Lys 

540 

Gly Aj3p Gly Ser Asp 

Thr Cy^ Thx- Lya His: 

570 

£:&r Lys Gly Asn Pro 
■ • 535 
Gly Sar Aep Glu" Lys 

600 

Arg Gir:i Ala Arg Vstl 

615. 

Tx?!- Pro Txp ♦Sin V^l 

630 

Cys Gly Ala S^r Lieu 

645 

Hi^' eya Ti'-r lis A^- 

650 

Gin Trp I'hr Ala Phe 

Ala P'Xq- Giy V&l Qln 

Pro Phe Ph& Aan Agp 

■ 705 

{jiu Leu Glu Lys Pro 

720 

Oy^- L-eu Pro Asp Aia 

Tip Val Thr Gly Tl-p. 

75-D 



SEQ 5/17 



Gly 


His 


Thr Gin 




G-ly 


Gly 


ThX Gly 


Xla. 




life 






Ly3 








755 




■ 
















765 


Gly 


Glu 




Val 


lie 




•Gin 


Ttar 






Glu 


Asn 




Leu 








770 










/ 1 ^ 










Ton 


Pro 


Gin 


Gin lie 


Tiir 


Pro 


Arg 








V.2l 




































79S 


Gly 




Val-jTiSp 








Gly 


Asp 






























BQ5 










810 




Val 


Glu Aiai 


Asp 


Gly 


Arg 






•Gin 


J^l*:^ 


CUy 


Val 


Val 










815 




















e^s 


Trp 




Asp Gly 




Ala 


Gin 




Asn 


Lys 




Gly 




Tyr 


Thr 








«s- •> t1 










S3- 5 














Leu 


Pro Lfiu 






Asp 


Trp 


lie 


Lys 


Glu 






Gly 


V-cil 






£45 










S50 










6-55 



<211> 256 

<212> FRT 

_ -^i i-t 

<221> EHOWAirJ 



hionvc'logouB to &iiailsr i5oti«ivj in TADS- 1 5 



<4Q0> 



3 



Arg lie val Gly sly s^xq Asp Thic Sear tl&j. Gly Asrg Txp Pro Trp 

5 10 15 

•Gin V5l g-er Leu isxg Tyr Asp Gly Ala His Leu Cys Gly Gly S'er 

20 ^0 

h&u Lsu Sec Gly AsD T]cp Vsl Leu Thr Aia Ala His Cys Phe f ro 

35 40 45 

Glu Arg Atjn Arg Val Leu S«x Axg Trp Arg Val Phe <?iy Ala 

50 55 SO 

Val Ala Gin Ala Ser Pro Kii' Gly Leu Gin Leu Qly Val Gin Ala 

65 70 75 

Val Val Tyr Hie Gly Gly Tyr Leu Pro Phe Arg Asp Pro astj Ser 

8<3 ■ , . SS ■ . ^'^ 

Glvi Glu Asn ser Asn Asp He Ala L^su Val His Leu- Ser Ser Pro 

95 100 105 



SEQ 6/17 



wo '99/421 2ft 



L-eu Pre Len Thx Oiu Tvr Il6 Gin Fro Vai Cys Leu Pro M-:* Ala 

no 112 120 

Gly Gin Ala Lex; val Asp Gly Ly© ll« Cys Tht Thx Qly Txp 

125 1-30 135 

tjlv Asn Tlir Gin Tyr Tyx '3ly Gill Gin Aia Gly Val Le^i Girj Glu 

140 145 150 

A1& Arg Veil Pro Ilia lie Ser Asa Asp Vsl Cya Asa Gly Al-a Asp 

155 iSD ISS 

Phsi Ti-r Gly &sn Gin He hys ^-y^ M«t Fhe Cys -Sly I'i'r 

170 175 ISO 

Pro Gin Cly Qly He A^p Ala C^--^ Glr. Gly Asp Ser Gly Gly Pro 

185 190 155 

V&l Cys Giu Asp Sftr He Sex Arg Thx. Ars T"p Arg L.«u 

200 205 2-10 

cys Gly lie Val Sex Trp £3ly Ttor Gly Cya Ala ieu Ala Gin :Lys 



215 2-20 



pro Gly Tyr Tin Lys Vai Ser Pha Ax^ Glu Isp lia Phe 

230 235 . 240 

Gin Ala He Lys Thr His Sar Glu Ala Sex Gly Ket Val Tiii- -aln 

jd5 250 .25!> 



Leu 



<210> 4 
<211;- 225 

'"•i ^ "Jl ' t 



^ TJ n ~: 



Serine pxoteaise catalytic dcanain ot Scca 
hosnologoxis to sijciilax d<3inaxn in 
<40a> 4 

Lys- He lie Asp Gly AJ.a Fxo Cys ?aa Arg Gly Sex Kia Pro itcp 

5 10 15 

Gin Val Ala Leu Leu Sex Gly Asn Gin Leu His Cys Gly -Gly Val 

20 25 30 

Leu Val Asn Glu Arg Txf' Val Leu Tta Aia Aia KiS Cys Lys Mst 
.35 43 



SEQ 7/17 



wo 1»9i'421i« 



PCT.'US9WQ343* 



hsn Glu Tyr Tiir val Hi£ L^U Gly S&r Asp Th^r Lea Gly Asp Arg 

50 55 SO 

Arcr Alii Gij3 Arg He Lys Ji.15 S«x Lye Ser Phe Ajrg Kis fxo C-iy 

65 ■ 70 75 

Vyc &er Thr Gin Thr Hii Val A^n Aap L.eu Hfet L^jl Vtil Lya Leu 

$0 S5 50 

Asn S«s Gin Ala :-^g Jbeu Ser Sex Me-t Val Lys Lys V.al Arg Leu 

35 100 105 

Pro Sex Arg Cys 01m Pro Pro Gly Thr Thr i^ys Thr Val Sex Gly 

110 115 120 

Trp Gly Thr Thr Thr Ser Pto A^p V^al Thr Phfe Pro A^ l^eu 

125 130 135 

Met • Cys val Asp V-al Lys Leu Il« S«r Fro Gin Asp Ojs TJir Lys 

140 ■ 145 i5D 

Val Tyr Lye Ajsi?- Le»a Leu Olu As-ii Sex- M«t Leu Cys Ala Oly lis 

155 . ■ . 

Pro Asp S^r Lys Lys A^n Ala Cys Asn Gly ?^-^p S&r Gly Gly Pro 

170 17 5. 130 

Leu Val Cys Arg Gly Thx Leu Gin Oly Leu Y-al Ser Tcp Gly Thr 

185 13f 

Phe Fro Cyjs'cly Gin Pro P.en P.ep Fro -Sly Val Ty-r Thr Gin y^l 

200 2T>5 210 

Cye Lys Phe ihr Lys Trp Ha Asn Asp Tlir Met Lys Lys Els Arg 

ois 220 



<2il> 225 
<212> PRT 

<32Q> 



■.;223> Serine protease catalytic i5e4ti£Lin tri-psin 

(Try) hcjciologcuas to similar domsiin iri TSDG-15 
<400> 5 

I 

Lys Ha Val Gly Gly lyx Asn Cys Glu Glu Asn S.ar Pro Tyr ■ 

5 10 IS 

Gin Val Sfer Leu Agaii S<?:r Qly Tyr Kis Ph« Cys Gly Gly B^r Leu 

20 25 30 



wo 



11 fi Asn 'Slu Tr^- Vai Val Ser Alii Gly His Cy:s Tyn Lys Ser 

35 'iO .<1S 

Arg lie Gin .V£il Leu fAy Glu His Asn lie Glu V-ai L^i.a Giu 

5i> 55 60 

Gly Asn Glu GItj Fhe life A^n Ala Ala Ly$ ll<^ He: Arg, His Pro 

$5 10 75 

Glrj Tys- A^ip Arg J-ys Siir L«u Asn Asn A.Gp He M^t L«u Il« Lys 

BO as 20 

Leu Ser Ssr Arg Ali Vai lie A^n A1& Arg Val S*r Tlir lln S&r 

35 100 105 

Lfeii Pre? ThT Ala Pri> Pro Ala Thr Gly Tbr Lys Q,'S Leu Il«a E«r 

110 115 ■ 120 

Gly rxp Gly Asn Tbr Ala Ser Ser Gly Ala Asp T^t Pro Asp <51u 



125 130 135 

Leu Gin Cys L^ju Asp- Ala Pro Val Leu Ser -oiii Ala Lyis Ci-'S Glu 

1-30 145 ISO 

Ala Ser Tyr Fro Gly Lys 11& Thr Sex Asn I4&t Phe Cys Val Gly 

155 150 155 

Pfcie L«a Glu Gly Gly Lys Asp Ser 'C^rS Oln Gly Asp Ser Oly Gly 

170 175 ISO 

Pro Val vai Cya Asi-^ Gly Gin Leu Gin Gly Viil val Ser Txp Gly 

135 ' 190 

A,sp Gly Cy3 Als Gin Lys Asn Lys Pro Gly Val Tyz Thr Lye Val 

"•no 205 210 

Tvr Asj:^ Tvr V^il l-.yH'= T'rp He Lys- A^n Tiar lie AI3 Aisn Ser 

220 -225 

<2r±> 2^1 

■r.212> PRT 
<213> . Uii]':no^ii!iri 
<220> 

(Chymfc) hocrioli>gou,5 co simlar dorcain in TAr«'J-15- 
<400> 6 

Arg He Val A^r. Gly Glu Asp Ala V.11I Pro- Gly S^ir T-rp Pro Trp 

5 10 . 15 



SEQ 9/17 



wo 99/42 1 20 



Gill Val Sar Lau Gin Asp hy& Thr Gly Phe His Phs Ch/e Qly Gly 

2.5 30 
Ser Leu lis S'ei- t3iu Asp Trp Val Val Thr Ala Ala Kis Cys Gly 

35 40 <3 5. . 

V-al Axa Tiir Ssr A:3i> y.al Val V£tl Gly Glu P>ie ?-sp Gin Gly 

S'^x Asp' Oiu Olvi Asn He Oln Val L&d Lys He Ala Lys va.1 Pha 

65 '50 

Lya Asn Pvo Lys Phe Ser lie Leu Thr Val Aen Asn Asp He Thr 

SO ■ &5 30 . 

L«u L<i!ii Lye Leu Ala Thx Pro Al£t Arg Pha S^sr Gin Thsr V-al Sex 

35 iOO 105 
Ala Val Cys Le-ii Pro Sex Ala Asp Asp A.sp fti« txo Ala Gly Thx 

llii lis 12C' 

Ltio eys Ala Thr IJhx Gly Trp Qly Lys Thx Lys Tyx ?.bii Ala Asn 

12s ' 130 12.5 

Lys Tlir Pro Asp Lys Leu Gin Glu Al^ Ala Leu Pro L«u Ley Ser 

140 . 145 ■ 150 

Aar. Ala Glu O/s Lys Lys Ser .Trp Gly Arg Arg He Tbx- Asp Val 

155 ' 160 • 165 

Met He Cys Ala Gly Ala £6;^ Gly Val sstr ser cyi' Hat Gly Aep 

170 . 175 130 

ser Gly qly Pro Leu Val Cys Gin Lys Asp Gly Ala Trp Thr Lau 

1S5 190 

val Gly II* Val Ser Trp. Gly Ser Asp Tlir Ci'B Ser Thx Ser Ser 

200 205 21D 

Pro Gly Val Tyr Ala Arg vai Thr Lys Leu lie Pro Trp Vil QXn 

215 

Lvs lit! Leu Ala Ala Asn 

230 

<210> 1 

<211> . 255 , 

<212> PRT 

<213> Unknc'Tm 
<220> 

<221> DCWAIM 

<223> Serin*: prateinse catalytic -clotnain of factor 7 

{Pac7) bomciogous to sitnilar domain ij^ TADQ-15, 



SBQ 10/17 



TCt,nJS99,'V242i 



<4.00> 



Ax^gr lie Val Gly Gly Ly^ Vdl Cy« Pro Lye Gly Qlu Cys Px'o Trp 



5 , ■ 10 . 15 

Gin Val Leu Leu Leu Vai hsn Cly Al^ Gin Leu Cy^ Gly Gly Thx 

20 55 30 

Leu lie Aen Thx lie Tacp V^l Vai Ser Ala Ala His O/s Phe Asp 

35 40 4S 

Lys He Lya A^n Trp- Arg Asn L^u He AI3 Val Leu Gly Glu Hijg 

50 55 50 

Asp Leu Sex -olu Hi a Asp Gly Asp Gla Gin St-r Arg Arg val Alia 

65 70 75 

Gin V«l Il« rae Pro Ser Thr Tyr- val Pro Gly TSir Thr Asn His 

BO . 85- ■ 

Asp II& Kl& Leu L^u Arg Li^ Kis Gin Prro Val Vial Leu TJir Asp 

. $5 100 
Hie V3I V^l Pvc? Leu Cye Lei Fro Glu Arg T'n.v Phe Sex Glu. Arg 

LIO 115 12D 

Thr Leu Ala Phe val Arg- Phe Ser L^u Val Se-r Gly Trp Gly Gin 

125 130 135 

Leu Leu Asp Axg Gly Ala Tlir Ala Leu Glu Leu Mst V^il Leu Asn 

140 145 150 

V&l Prg Arg Lsu Met Thr Gin Asp Cys Let! Gin Gin S«r Axg Lys 

155 160 165 

Val. Gly Asp Ser Pro Asn He Thr {?lu T'l.rr M«t Phe Cys Ala Qly 
. . 170 175 180 

T-yr S<5x Aap Gly Ser Lys Asp Cys Ly-s Gly A$p Ser Gly Gly 

1S5 155. 
FTD Ai* Thr Hie 'Vyr /ixg Gly Thr Txp Tyr Leu Thr Gly He 

200 205 210 

Val Sex Txp Gly Gin Gly Ojs Als Thr val Gly Bis Pie Gly Val 

215 220 • . 225 

lyx Thx Arg Val Sex Gin Tyr He Glu Trp Leu Glu Lys M«t 

23 0. 235 240 

Axc! Sfer Glu Pro Axa Fxi> Gly Val Leu Leu Arg Al* Pro Phe Piro 

245 250 255 

<21D> a 
•^11> 253 



SEQ tl/I7 



<213> PBT 

<220> 

;1> EO^aiN ' 



^iiftilar d-rMTi^in in TALG-IS, . 

Arg lie Lye Gly -Sly Le-u Pb-s AJ.^^ Aisp lie Aia Ser Kis Pro T:i-^p 

5 .10 15 

Gin Ala Ala il-st Pb-a Ala Lys His Arg Ar.^ Sar Prp Gly Glu Axg 

20 25 3 0 

File Leu Gyb Gly Giy Il« ^^^^^ 2:ie Ser Se-r Cy^. Trp Ila L&u S-ar 

1 c. 40 -3 5 

Ala Ala Kia Cys PTia Gin Glu Arg Phe Fro Pro Hie His Lau Thx 

50 55 5l> 

V-51.1 lie h&^ Gly Arg Thr 'Ti-'r Arg Val Val Ftq Gly Giu Glu <3lu 

65 ■ ' 70 75 

Gin Lys Fh© ^3lu val Glu Lys T'yr He Val His Lys Glu Phe Asp 

30 .65 90 

Asp Asp Thx Tirr Asp Aen Asp He Al* Leu Leu Gin Lfe-u Ly3 S^i: 

95 IQP 105 

Asp Sier S^sr Ajrg Cys Alii Gin Glu S^sr S-isr val Val Arg Thr Val 

110 115 1^0 

CyjB Leu Pro Fro Ala As& Leu Gin L^ij fxo Asp ircp Thr cilu Cys 

125 i30 135 

Qlu Leu Sea: Gly Ti'r iSly Lys His Giu Ala L-eu Ser Ftto Ptte "Tj^r 

140 145 150 

Ser Glu Arg Leu Lya -Qlu Ala His Val Axg Leu Tys ^xo Sex Ser 

155 .3.60 165 

Arg Cye Thr Ser Gin His L<iu Leu Asn Arg Thr val Thr Asp Asn 

170 ns. 180 

Met Lau Cys Ala Gly Aisp Thr Axg Ser -oly Gly Pro Gin Ale Asn 

185 ISO 1^5 

Lssu Kie Asp Aia Cys •31ri Gly Asp Ser Gly Gly Pro L«u Val Cys 

200 205 210 



SEQ VLni 
SlfBSTITOTE 5HEST (RULE 2SJ 



SVO 99/413 25) 



L«u A£.n Asp Gly Arg M«t Thr Le-a Val Gly He lis Sfe^r Trj:. Gly 

215 220 225 

L^su Gly Cys Gly Gin Lys Asp- val Fro Gly Val lyx Thr Lye Val 

230 :^3-5 2^0 









250 


■ 
















«L* ^ r 






















* 














tf '? 0 A*v 












^ «U ^ — • 




mP^MSt Ee<5?aence (tj20423) 






A n fj ■> 














1„ y 1^ '^'^ »•»• "•s? — 


cgtgctgafcc 


ggcctcctct: 


tggccttgct 








III _^ 

ggcattt^'-SL 


gt^auo^^c^nsc 


CTtgc-gtatcc 


100 




.'^ nn^ t&c 


atgaggatca 


caaa i!igiaga.a 


ttttgtggat 


, 150 






cactgagttt 


1 ^HM IW 

gta^gcccgg 


ccagcaaggt 


2D0 








•agcc ccact-G 


ctgggcoci-cit 


2 D 




a* ^ s" y i^a 2^ 


^•:;:ggcctt.C'3 


1^ ^ l^^a B^^i 

goga«jggoau 










csitc-ccgcag 


•:2A<::ctggttg 


5:gg3«ggQC5a 


350 


»*p i*r .-H r 4* 3 ^ i"^ 




cg't-igtcratg 


ctgcccccgc 


gggcgcgcLc 

41 4> IT W 


400 




L» 'o'^ I- wjj 


cctv^'gtggt 


iJQfCt.t.t.CC-CC 


acgga-^-ticc^i 


450 


;^ s a r*^«'*j f i^r'^a. 


• ' ^ S3 




goag-iJtttcrg 


•r^ctgcAcgcc 


500 


i^*^^-TfT H i-S'-t-t~T.*T;:T 


'aji •>«• -*y ^ V 




cggcttcc^t 


gacagcccct 


550 






dngtgggctg 




acgcagtgct 


600 


(j^HHShI >_i> O'^^* 'ta m Vj^ 




cagctcgaot 


gcgir^ctegjeic: 


gagcgcggca. 


€■50 


gCy -iloC- C^'^ 




acac-dctg^g 


cpccatggag 


ccccacgoot 


7 DO 








acaacci-gac 




75D 


c tec cacga^ 


cgtcctgctc 


atcacaccg-ai 




tgrscgcggca 


300 


t-cccoac'ttt 


gagg<:cracct 


tcttccagct 


crcctaggat^f 


acjoag-ctgte 


S50 


ga-ggccgctt 










900 








tggsiaaattg 


aggtgcccaA 


S-SO 








cttctacctg 


ctggagGCcg 


1000 


gogtgcctgq 


gi?gcacGtgc 




acgtggagat 


caatgr^ggag 


1050 




gagatgeiogtc:* 


ecagttcgtc 


gtcac-i:a$-ii^ 


acagcaacaa 


1100 




cgcttccact 


cagatcagtt^ 




accggcttcrt 


1150 


tacfotgaata 




gactdcagtg 


acccaci3ccc 


ggggcagttc 


1200 






tatccgga^ag 


gaijctgcgcci 


gtgatggctg 


1250 



SEQ 13.''17 



ggcgsctgca ccgaccacag cgatgagct-c aact-gcagGt gcfjacgco^SQ 13 00 

ccaC'::agttc: acgtsc-^ag^i g.ii^.a-igttctg Ciias.::itcttc tgggtctgcg 1350 

acagcgtg^a- cgsgtgcgga g-5^'-3s<=^^c:g acgagicaggg ttgc^tttgt IJOQ 

ccgg-scocag acct-tcaggt gttccaatgg gaagtgcctic t.c.ijaaaet?^3c 1450 

5igc-^5tgcaa tgg-g&aggac g^ctgtgggg acgggtccgs cgsLggc^ctcc 1500 

tgccccaagg tgaacgtcgt cacttgtacc 5j©ac.3,cacct ^ccgctgcct 1550 

eaatgggctc- tgcttgagca agggce^accc ugagcgcgac gggaa-5jg.3.gg 16 00 

actgtaecga cggctc^agat gaga-^i^gg^iCt .3.£gact.gtgg g-stgcggtca 15 50 

ttcadg^ga^c a^ggctcgtgt tgttgggggc ^^cggg^tgcgg -^itg-Jtgggcg.SL 17 00- 

gcggccctgg caggtaagcc t.gc^scg.ct-ct GQSCcagggc C5C«tctgc9 17 50 

gt'^cttccct ci^tct^^acc ^^atgqctgq tatctq^cgc ^^c^ctgct.^^c 1200 

atcgatgac5L g^gg^tttc^g gtactc^gac' cccacgcagg .^icggcvttcc 1550 

t^ggcttgca cgaccagagc cagcrioag^rc cctggggtgc ag^-^i^cgcag 1900 

gct.caag<--gc atcatct-ccc acoccttctt oa^^tgacttc accttcgacc 19S0 

atgaqatcgc gctgctggag .:2tgg^g,-iAac c^gg^rizigiigta c^igctccatg! 20 0*0 

gtgcggeci-r^a tctgcc'tgc=c ggacgcctgc «tgtc^ttcc ctgocggr-aa 20 SO 

ggccatctgg gtcacgggct ggggacacac ccagtatgga sscactggcg 2100 

cgctgatoct gs^asi^gggt g^g^itccigir^g tic^t.-2iadcia giiocaoctgc 2150 

g^ga-acctc^; tgccgcagca gatc3cgccg cgcatgj^tgt g':::gt5ggr::tt 2200 

cctcagcggc ggcgtggact cctgccsggg cgsti^ccggg ggacccctgt 2250 

ccagcgcgga ggcggatiggg cggatcttdc igg.:::ciggtgt ggcgsgctgi? 2300 

ggdsgacgctg cg^Jtcsigagg aiacaagccag gcgtgt^acsic di£iggctcc5ct ->350 

c tg t ttcggg t gga tcaa agag^isc^ ct ggt^tatagg ggccggggci.:^ 2400 

acccaaatgt gtacacctgc gqggcc^cco atcgtocacc ccagtgtgca 2450 

c:gcctg.:2^igg ct-gg-^g^ctc gcgc3ccgtg. acctgcacca ^cgccccQ.ga 2500 

scatacactg tgaactcatc tccaggctca aacot.gct.ag aaaacctctc .2550 

gctccctcag cctccaaagt gg ngctgggn gggtagci^igg ggaggaacac 2600 

tggtggttct J3ictgac':^caa ctggggcaag gtttgaagca c^gcwcggc 26S0 

^gcci:?adgtg ggcgaggacg cgtttgtgca cactgccctg ctct5:tacac 2700 

■ggaagacctg gatctcragt gaetgtg-^ict g-acgg^itctg actgcggtcc 2750 

t.tggccacg-2 tt-:rttgagga s-gccc-^ggct cggaggaccc tgg^iai^acag 2S0*u' 

acgg-srtctga aactgaa&at ggtttsiccag ctcccaggtg acttc^gtgt 2350 

gtgti^ttgtg tsaatgagi:a aaacatttta tttcttttaa a«aaaaaaaa ^yOv 



<210> 10- 

<211> 20 

<2i2> CWA 

<213> ArtLifioi-al S-e-quencfi 
<220> 



SEQ 14/17 















analYs^is of cn/sx^z-csi-re 






o£ TACC-IS HiiKra- by 


cjiaaritit^tive PCR- 




<400> 


10 














<21C'> 


11 






<211> 


20 






<212> 








<212> 


Artificial S«=i^^eiice 


- 


« "-S *> l""! 








-i -1. 










Rearer se primer for 




£:.S:iQ3a 






quant it?itive FCR. 




<400> 


11 










0 




<210> 










17 






<312> 










Artificial Sequence 






<220> 








<22 1> 








O O O 1.^ 
*« Ji* J 


Fozrw.3LX*d priifuar for 


i3nalrs:is of |J-fciibulin 





ercf'reeBiorj. t^^.^ Quantitative PGR. 
<400> 13 



<210> 


13 


<2ll> 


17 






<213> 


Artificial SeijiJience' 


•i:220> 




<221> 


priiaer 


<223> 


Reverse primer fox analysis of 




. fi3-cE:.i:e.sBiorx by .3»aaiit:itativ& PCH„ 


<dOO> 


* 

12 ■ 



ctgtcttgac attgfccg ^'^ 



SEQ 15/1 



wo 99/42'm 



<aiD> X4 
■ <211> 242 
<212> FTIT 
<213> IScroo sapiens 

<320> 

Arg Val Val Gly Gly Thr A^sp Aia Asp Glu Gly Glu Trp Pro T'srp' 

5 10 15 

Gin Val S-sr Leu His Ala L&u Qly Sin Gly Plis 11^2 Cys Gly AI5: 

20 2 5 30 

S'&x Leu 11^ Ser p-ro Asn Trp Leu Vul S^x AI^l AI^l Hi;5 Cys Tyr 

3 5 iC' 45 

11^ Asp A:=.p Arg C51y Pha Arg Tyx Seir A.5p Pro Thr Gin Trp Thr 

50 . 55 $0 

Ala Ph6 Leu Gly Leu His A^ Gin S-ar Gin Arg S^x Al^ Pro Gly 

. ■ ■ 65 . 70 "JS 

Val GItj i31u Axg Arg Leu Ly^ Arg lie lie ser His Pro Pfai* Pb$ 

SO 65 -50 

A-sn A$p Phe Thx P'ne Asp T^-r Ass-p 11'$ A.I5. Lau Giu l^u Giu 



95 



100 10: 



Lys Pro Ala Gl\j! Tyr S-ax Sax llet Va.1 Argr Pro xl^ Cys L^u Pro 

110 . lis ■ 120 

Asp Ala Ser His Val Phe Pr^? Ala. Gly Lys Ala He T^p Val Tiir 

125 130 1J5 

Gly Ttcp Gly His Thr* Gin lyi: Gly Gly Tnr Gly Ala Leu lie Lciu 

140 145 150 

Gin Lys Gly Giu He Arg V.al He Aan Gin Thr Tiir Cy-s Ciu Asa 

' 155 ISO . 165 

h&u Leu Pro Gin Gin He Thr Pro Aorg Met Met Cys Val Gly Pha 

170 . 175 ■* lao 

L^u £er Gly Qly V^l Asp £er C^s Gin Gly Agp Sex Gly Oly Pro 

135 190 135 

L^a Ser Ser val Giu Asp. Gly Arg 11^ Fh0 Gin A.la Gly Vai 

200 205 210 

Vai S^x Trp- Gly Asp -Sly €yz Ala Qin Axg Asn Lys Pro Gly Val 

215 220 225 

SEO 16/17 
SUBSTlTVrE SHEET {RULE 2fl) 



